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SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM 

5 This application is a continuation-in-part of the following applications: 

(1) Ser. No. 60/XXX,XXX (converted to a provisional application from non- 
provisional application Ser. No. 08/868,899), fUed June 4, 1997; 

(2) Ser. No, 60/XXX,XXX (converted to a provisional application from non- 
provisional application Ser. No. 08 / 868,898), filed June 4, 1997; 

10 (3) Ser. No. 60/XXX,XXX (converted to a provisional application from non- 

provisional application Ser. No. 08/869,192), filed June 4, 1997; 

(4) Ser. No. 60/XXX,XXX (converted to a provisional application from non- 
provisional application Ser. No. 08/869,191), filed June 4, 1997; 

(5) Ser. No. 60/XXX,XXX (converted to a provisional application from non- 
15 provisional application Ser. No. 08/869,193), filed June 4, 1997; 

(6) Ser. No, 60/XXX,XXX (converted to a provisional application from non- 
- provisional application Ser. No. 08/868,697), filed June 4, 1997; 

(7) Ser. No. 60/XXX,XXX (converted to a provisional application from non- 
provisional application Ser. No. 08/868,698), filed June 4, 1997; 

20 (8) Ser. No. 60/XXX,XXX (converted to a provisional application from non- 

provisional application Ser. No. 08/868,900), filed June 4, 1997; 

(9) Ser. No. 60/XXX,XXX (converted to a provisional application from non- 
provisional application Ser. No, 08/868,696), filed June 4, 1997; 

(10) Ser. No. 60/XXX,XXX (converted to a provisional application from non- 
2 5 provisional application Ser, No. 08/869,194), filed June 4, 1997; 

all of which are incorporated by reference herein. 
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FIELD OF THE I^A^NTIQN 
The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

5 

BACKGROUND OF THE INVENTION 
Technology aimed at the discovery of protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning techniques 

10 clone novel polynucleotides "directly" in the sense that they rely on information directly 
related to the discovered protein (i.e., partial DNA /amino acid sequence of the protein 
in the case of hybridization cloning; activity of the protein in the case of expression 
cloning). More recent "indirect" cloning techmques such as signal sequence cloning, which 
isolates DNA sequences based on the presence of a now well-recognized secretory leader 

15 sequence motif, as well as various FCR-based or low stringency hybridizahon cloning 
techniques, have advanced the state of the art by making available large numbers of 
DNA/amino acid sequences for proteins that are known to have biological activity by 
virtue of their secreted nature in the case of leader sequence cloning, or by virtue of the 
cell or tissue source in the case of PCR*based techniques. It is to these proteins and the 

2 0 polynucleotides encoding them that the present invention is directed. 

SUMMARY OF THE INVENTION 
In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
25 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 266 to nucleotide 1651; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 NO:l from nucleotide 521 to nucleotide 1651; 

(d) a polynucleotide comprising-the nucleotide sequence of SEQ ID 
NO:l from nucleotide 335 to nucleotide 634; 
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(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone as294_3 deposited under accession 
number ATCC 98444; 

(f) a polynucleotide encoding the full-length protein encoded by the 
5 cDNA insert of clone as294_3 deposited under accession number ATCC 98444; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone as294_3 deposited under accession number 
ATCC 98444; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
1 0 insert of clone as294_3 deposited under accession number ATCC 98444; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity, the fragment 
1 5 comprising eight consecutive amino acids of SEQ ID NO:2; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(I) a polynucleoride which encodes a species homologue of the protein 
of (i) or (j) above ; and 

2 0 (m) a polynucleotide that hybridizes under stringent conditions to any 

one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:l from nucleotide 266 to nucleotide 1651; the nucleotide sequence of SEQ ID NO:l 
from nucleotide 521 to nucleotide 1651; the nucleotide sequence of SEQ ID NO:l from 

2 5 nucleotide 335 to nucleotide 634; the nucleotide sequence of the full-length protein coding 

sequence of clone as294_3 deposited under accession number ATCC 98444; or the 
nucleotide sequence of a mature protein coding sequence of clone as294_3 deposited 
under accession number ATCC 98444. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 

3 0 of clone as294_3 deposited under accession number ATCC 98444. In yet other preferred 

embodiments, -the present invention provides a polynucleotide encoding a protein 
comprising the amino add sequence of SEQ ID NO'.2 from amino acid 1 to amino acid 123. 
In further preferred embodiments, the present invention provides a pol)mudeotide 
encoding a protein comprising a fragment of the amino add sequence of SEQ ID NO:2 
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having biological activity, the fragment preferably comprising eight (more preferably 
twenty, most preferably thirty) consecutive amino acids of 5EQ ID NO:2, or a 
polynucleotide encoding a protein comprising a fragment of the amino acid sequence of 
SEQ ID NO:2 having biological activity, the fragment comprising the amino acid sequence 
from amino acid 226 to amino acid 235 of SEQ ID NO:2. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:l. 

In other embodiments, the present invenhon provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of SEQ ID NO:2 from amino acid 1 to 
amino acid 123; 

(c) fragments of the amino acid sequence of SEQ ID NO:2 comprising 
eight consecutive amino acids of SEQ ID NO:2; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
as294_3 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:2 or the amino acid sequence 
of SEQ ID NO:2 from amino acid 1 to amino acid 123. In further preferred embodiments, 
the present invention provides a protein comprising a fragment of the amino acid 
sequence of SEQ ID NO:2 having biological activity, the fragment preferably comprising 
eight (more preferably twenty, most preferably thirty) consecutive amino acids of SEQ ID 
NO:2, or a protein comprising a fragment of the amino acid sequence of SEQ ID 
NO:2having biological activity, the fragment comprising the amino acid sequence from 
amino acid 226 to amino acid 235 of SEQ ID NO:2. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO;3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 262 to nucleotide 3096; 

(c) a polynucleotide comprising the nucleotide sequence^of SEQ ID 
NO:3 from nucleotide 1118 to nucleotide 1527; 
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(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone aw92_l deposited under accession 
number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone aw92_l deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone aw92„l deposited under accession number 
ATCC 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone aw92_l deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of 5EQ ID NO:4; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:4; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide w^hich encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 262 to nucleotide 3096; the nucleotide sequence of SEQ ID NO:3 
from nucleotide 1118 to nucleotide 1527; the nucleotide sequence of the full-length protein 
coding sequence of clone aw92_l deposited under accession number ATCC 98444; or the 
nucleotide sequence of a mature protein coding sequence of clone aw92_l deposited 
under accession number ATCC 98444. In other preferred -embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone aw92_l deposited under accession number ATCC 98444. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:4 from amino acid 287 to amino acid 
422. In further preferred embodiments, the present invention provides a polynucleotide 
encoding a protein comprising a fragment of the amino add sequence of SEQ ID NO:4 
having biological activity, the fragment preferably comprising eight (more preferably 
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twenty, most preferably thirty) consecutive amino acids of SEQ ID NO:4, or a 
polynucleotide encoding a protein comprising a fragment of the amino acid sequence of 
SEQ ID NO:4 having biological activity, the fragment comprising the amino add sequence 
from amino acid 467 to amino acid 476 of SEQ ID NO:4. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:3. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) the amino acid sequence of SEQ ID NO:4 from amino acid 287 to 
amino acid 422; 

(c) fragments of the amino acid sequence of SEQ ID NO:4 comprising 
eight consecutive amino acids of SEQ ID NO:4; and 

(d) the amino acid sequence encoded by the cDN A insert of clone 
aw92_l deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:4 or the amino acid sequence 
of SEQ ID NO:4 from amino acid 287 to amino acid 422. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:4 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) consecutive amino 
acids of SEQ ID NO:4, or a protein comprising a fragment of the amino add sequence of 
SEQ ID NO:4having biological activity, the fragment comprising the amino add sequence 
from amino acid 467 to amino acid 476 of SEQ ID NO:4. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected- from the group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:5; 

(b) ..a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO-^ from nucleotide 612 to nucleotide 806; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
" NO:5 from nucleotide 744 to nucleotide 806; 



wo 98/55614 



PCTAJS9S/11210 



(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 1 to nucleotide 794; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bd316_2 deposited under accession 
number ATCC 98444; 

(0 a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bd316_2 deposited under accession number ATCC 98444; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone bd316_2 deposited under accession number 
ATCC 98444; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone bd316_2 deposited under accession number ATCC 98444; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:6; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:6 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO: 6; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(I) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-{j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 612 to nucleotide 806; the nucleotide sequence of SEQ ID NOS from 
nucleotide 744 to nucleotide 806; the nucleotide sequence of SEQ ID NO:5 from 
- nucleotide 1 to nucleotide 794; the nucleotide sequence of the full-length protein coding 
sequence of clone bd316_2 deposited under accession number ATCC 98444; or the 
nucleotide sequence of a mature protein coding sequence of clone bd316_2 deposited 
under -accession number ATCC 98444. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone bd316_2 deposited under accession number ATCC 98444. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:6 from amino acid 1 to amino acid 61. 
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In further preferred embodiments, the present invention provides a polynucleotide 
encoding a protein comprising a fragment of the amino acid sequence of SEQ ID NO:6 
having biological achvity, the fragment preferably comprising eight (more preferably 
twenty, most preferably thirty) consecutive amino adds of SEQ ID NO:6, or a 
5 polynucleohde encoding a protein comprising a fragment of the amino acid sequence of 
SEQ ID NO:6 having biological activity, the fragment comprising the amino acid sequence 
from amino add 27 to amino acid 36 of SEQ ID NO:6. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:5. 

10 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; 

(b) the amino acid sequence of SEQ ID NO:6 from amino acid 1 to 
15 amino add 61; 

(c) fragments of the amino acid sequence of SEQ ID NO:6 comprising 
eight consecutive amino acids of SEQ ID NO:6; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bd316_2 deposited under accession number ATCC 98444; 

2 0 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:6 or the amino acid sequence 
of SEQ ID NO:6 from amino acid 1 to amino acid 61. In further preferred embodiments, 
the present invention provides a protein comprising a fragment of the amino acid 
sequence of SEQ ID NO:6 having biological activity, the fragment preferably comprising 

2 5 eight (more preferably twenty, most preferably thirty) consecutive amino acids of SEQ ID 

NO:6, or a protein comprising a fragment of the amino acid sequence of SEQ ID 
NO:6having biological activity, the fragment comprising the amino acid sequence from 
amino acid 27 to amino acid 36 of SEQ ID NO:6. 

In one embodiment, the present invention provides a composition comprising an 

3 0 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:7; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:7 from nucleotide 7 to nucleotide 300; 
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(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 1 to nucleotide 363; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bkl30_4 deposited under accession 
number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bkl30_4 deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone bkl30_4 deposited under accession number 
ATCC 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone bkl30_4 deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:8; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:8 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO: 8; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(I) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 7 to nucleotide 300; the nucleotide sequence of SEQ ID NO:7 from 
nucleotide 1 to nucleotide 363; the nucleotide sequence of the full-length protein coding 
sequence of clone bkl30_4 deposited under accession number ATCC 98444; or the 
nucleotide sequence of a mature protein coding sequence of clone bkl30_4 deposited 
under accession number ATCC 98444. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone bkl30_4 deposited under accession number ATCC 98444. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:8 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
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preferably thirty) consecutive amino acids of SEQ ID NO:8, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:8 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 44 
to amino acid 53 of SEQ ID NO:8. 
5 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

ID NO:7. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

1 0 (a) the amino acid sequence of SEQ ID NO:8; 

(b) fragments of the amino acid sequence of SEQ ID NO:8 comprising 
eight consecutive amino acids of SEQ ID NO:8; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
bkl30_4 deposited under accession number ATCC 98444; 

15 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID .NO:8. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:8 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) consecutive amino 

2 0 adds of SEQ ID NO:8, or a protein comprising a fragment of the amino acid sequence of 

SEQ ID NO: 8 having biological activity, the fragment comprising the amino acid sequence 
from amino acid 44 to amino acid 53 of SEQ ID NO:8. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of; 
25 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:9; 

(b) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:9 from nucleotide 52 to nucleotide 1863; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 NO:9 from nudeotide 1219 to nucleotide 1863; 

(d) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:9 from nucleotide 1099 to nudeotide 1743; 
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(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bvl31_5 deposited under accession 
number ATCC 98444; 

(f) a polynucleotide encoding the full-length protein encoded by the 
5 cDNA insert of clone bvl31_5 deposited under accession number ATCC 98444; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone bvl31_5 deposited under accession number 
ATCC 98444; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
10 insert of clone bvl31_5 deposited under accession number ATCC 98444; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:10; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:10 having biological activity, the fragment 
15 comprising eight consecutive amino acids of SEQ ID NO:10; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(I) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

2 0 (m) a polynucleotide that hybridizes under stringent conditions to any 

one of the polynucleotides specified in (a)-(j)- 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 52 to nucleotide 1863; the nucleotide sequence of SEQ ID NO;9 from 
nucleotide 1219 to nucleotide 1863; the nucleotide sequence of SEQ ID NO:9 from 

2 5 nucleotide 1099 to nucleotide 1743; the nucleotide sequence of the full-length protein 

coding sequence of clone bvl31_5 deposited under accession number ATCC 98444; or the 
nucleotide sequence of a mature protein coding sequence of clone bvl31_5 deposited 
under accession number ATCC 98444. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 

3 0 of clone bvl31_5 deposited under accession number ATCC 98444. In yet other preferred 

embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:10 from amino acid 430 to amino acid 
564. In further preferred embodiments, the present invention provides a polynucleotide 
encoding a protein comprising a fragment of the amino acid sequence of SEQ ID NO:10 
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having biological activity, the fragment preferably comprising eight (more preferably 
twenty, most preferably thirty) consecutive amino acids of SEQ ID NO: 10, or a 
polynucleotide encoding a protein comprising a fragment of the amino acid sequence of 
SEQ ID NO: 10 having biological activity, the fragment comprising the amino acid 
5 sequence from amino acid 297 to amino acid 306 of SEQ ID NO:10. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:9. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
10 consisting of: 

(a) the amino acid sequence of SEQ ID NO:10; 

(b) the amino acid sequence of SEQ ID NO:10 from amino acid 430 to 
amino acid 564; 

(c) fragments of the amino acid sequence of SEQ ID NO: 10 comprising 
15 eight consecutive amino acids of SEQ ID NO: 10; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bvl31_5 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO: 10 or the amino acid sequence 
20 of SEQ ID NO:10 from amino acid 430 to amino acid 564. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:10 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) consecutive amino 
acids of SEQ ID NO:10, or a protein comprising a fragment of the amino acid sequence of 

2 5 SEQ ID NO:10having biological activity, the fragment comprising the amino acid 

sequence from amino acid 297 to amino acid 306 of SEQ ID NO:10. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 NO:ll;- 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l 1 from nucleotide 67 to nucleotide 690r 

(c) a poljmucleotide comprising the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 1 to nucleotide 576; 



12 



wo 98/55614 



PCT/US98/11210 



(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bv227_l deposited under accession 
nxunber ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
5 cDNA insert of clone bv227_l deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone bv227_l deposited under accession number 
ATCC 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
10 insert of clone bv227_l deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 12; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:12 having biological activity, the fragment 

15 comprising eight consecutive amino acids of SEQ ID NO:12; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

20 (l) a polynucleotide that hybridizes under stringent conditions to any 

one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 67 to nucleotide 690; the nucleotide sequence of SEQ ID NO:ll 
from nucleotide 1 to nucleotide 576; the nucleotide sequence of the full-length protein 

2 5 coding sequence of clone bv227_l deposited under accession number ATCC 98444; or the 

nucleotide sequence of a mature protein coding sequence of clone bv227_l deposited 
under accession number ATCC 98444. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone bv227_l deposited under accession number ATCC 98444. In yet other preferred 

3 0 embodiments, the present invention provides a polynucleotide encoding a protein 

comprising the amino acid sequence of SEQ ID NO:12 from amino add 1 to amino acid 
170. In further prreferred embodiments, the present invention provides a polynucleotide 
encoding a protein comprising a fragment of the amino acid sequence of SEQ ID NO:12 
having biological activity, the fragment preferably comprising eight (more preferably 
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twenty, most preferably thirty) consecutive amino acids of SEQ ID NO: 12, or a 
polynucleotide encoding a protein comprising a fragment of the amino acid sequence of 
SEQ ID NO: 12 having biological activity, the fragment comprising the amino acid 
sequence from amino acid 99 to amino acid 108 of SEQ ID NO: 12. 
5 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

ID NO:ll. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

1 0 (a) the amino acid sequence of SEQ ID NO: 12; 

(b) the amino acid sequence of SEQ ID NO:12 from amino acid 1 to 
amino acid 170; 

(c) fragments of the amino acid sequence of SEQ ID NO:12 comprising 
eight consecutive amino acids of SEQ ID NO: 12; and 

15 (d) the amino acid sequence encoded by the cDNA insert of clone 

bv227_l deposited under accession number ATCC 98444; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO: 12 or the amino add sequence 
of SEQ ID NO:12 from amino acid 1 to amino acid 170. In further preferred embodiments, 

2 0 the present invention provides a protein comprising a fragment of the amino acid 
sequence of SEQ ID NO: 12 having biological activity, the fragment preferably comprising 
eight (more preferably twenty, most preferably thirty) consecutive amino acids of SEQ ID 
NO: 12, or a protein comprising a fragment of the amino acid sequence of SEQ ID 
NO;12having biological activity, the fragment comprising the amino acid sequence from 

2 5 amino acid 99 to amino acid 108 of SEQ ID NO: 12. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO;13; 

3 0 (b) a polynucleotide comprising the -nucleotide sequence of SEQ ID 

NO:13 from nucleotide 657-to nucleotide 1469; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 678 to nucleotide 1103; 
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(d) a polynucleotide comprising the nucleotide sequence of the hill- 
length protein coding sequence of clone cd265_ll deposited under accession 
number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone cd265_ll deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone cd265_l 1 deposited under accession number 
ATCC 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone cd265_ll deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:14; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:14 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO: 14; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 657 to nucleotide 1469; the nucleotide sequence of SEQ ID NO:13 
from nucleotide 678 to nucleotide 1103; the nucleotide sequence of the full-length protein 
coding sequence of clone cd265_ll deposited under accession number ATCC 98444; or the 
nucleotide sequence of a mature protein coding sequence of clone cd265_ll deposited 
under accession number ATCC 98444. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of done cd265_l 1 deposited under accession number ATCC 98444. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
c<5mprising the amino acid sequence of SEQ ID NO:14 from amino acid 8 to amino add- 
149. In further preferred embodiments, the present invention provides a polynucleotide 
encoding a protein comprising a fragment of the amino acid sequence of SEQ ID NO:14 
having biological activity, the fragment preferably comprising eight (more preferably 
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twenty, most preferably thirty) consecutive amino acids of SEQ ID NO:14, or a 
polynucleotide encoding a protein comprising a fragment of the amino acid sequence of 
SEQ ID NO:14 having biological activity, the fragment comprising the amino acid 
sequence from amino acid 130 to amino acid 139 of SEQ ID NO:14. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:13. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:14; 

(b) the amino acid sequence of SEQ ID NO: 14 from amino acid 8 to 
amino acid 149; 

(c) fragments of the amino acid sequence of SEQ ID NO: 14 comprising 
eight consecutive amino acids of SEQ ID NO:14; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
cd265_ll deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO: 14 or the amino acid sequence 
of SEQ ID NO: 14 from amino acid 8 to amino acid 149. In further preferred embodiments, 
the present invention provides a protein comprising a fragment of the amino acid 
sequence of SEQ ID NO:14 having biological activity, the fragment preferably comprising 
eight (more preferably twenty, most preferably thirty) consecutive amino acids of SEQ ID 
NO: 14, or a protein comprising a fragment of the amino acid sequence of SEQ ID 
NO:14having biological activity, the fragment comprising the eunino acid sequence from 
amino acid 130 to amino add 139 of SEQ ID NO:14. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 261 to nucleotide 896; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 330 to nucleotide 896; 
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(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO;15 from nucleotide 1 to nucleotide 515; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone ej265_4 deposited under accession 
number ATCC 98444; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ej265_4 deposited under accession number ATCC 98444; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ej265_4 deposited under accession number 
ATCC 98444; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
ir\sert of clone ej265_4 deposited under accession number ATCC 98444; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:16; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO: 16 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:16; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide that hybridizes \mder stringent conditions to any 
one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NrO:15 from nucleotide 261 to nucleotide 896; the nucleotide sequence of SEQ ID NO:15 
from nucleotide 330 to nucleotide 896; the nucleotide sequence of SEQ ID NO:15 from 
nucleotide 1 to nucleotide 515; the nucleotide sequence of the full-length protein coding 
sequence of clone ej265_4 deposited under accession number ATCC 98444; or the 
nucleotide sequence of a mature protein coding sequence of done ej265_4 deposited under 
accession number ATCC 98444. In other preferred embodiments, the polynucleotide 
encodes the full-length or a mature protein encoded by the cDNA insert of clone ej265_4 
deposited under accession number ATCC 98444. In yet other preferred embodiments, 
the present invention provides a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO: 16 from amino add 1 to amino acid 85. In further preferred 
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embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino add sequence of SEQ ID MO:16 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) consecutive amino acids of SEQ ID NO: 16, or a polynucleotide encoding 
5 a protein comprising a fragment of the amino acid sequence of SEQ ID NO:16 having 
biological activity, the fragment comprising the amino acid sequence from amino add 101 
to amino acid 110 of SEQ ID NO:16. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:15. 

10 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO: 16; 

(b) the amino acid sequence of SEQ ID NO: 16 from amino add 1 to 
1 5 amino acid 85; 

(c) fragments of the amino add sequence of SEQ ID NO: 16 comprising 
eight consecutive amino acids of SEQ ID NO: 16; and 

(d) the amino acid sequence encoded by the cDNA insert of done 
ej265_4 deposited under accession number ATCC 98444; 

20 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO: 16 or the amino acid sequence 
of SEQ ID NO:16 from amino acid 1 to amino acid 85. In further preferred embodiments, 
the present invention provides a protein comprising a fragment of the amino acid 
sequence of SEQ ID NO:16 having biological activity, the fragment preferably comprising 

2 5 eight (more preferably twenty, most preferably thirty) consecutive amino adds of SEQ ID 

NO: 16, or a protein comprising a fragment of the amino acid sequence of SEQ ID 
NO:16having biological activity, the fragment comprising the amino acid sequence from 
amino acid 101 to amino acid 110 of SEQ ID NO:16. 

In one embodiment, the present invention provides a composition comprising an 

3 0 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ. ID 

NO:17; 

(b) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:17 from nudeotide 946 to nucleotide 2232; 
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(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 1336 to nucleotide 1853; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone ey29_8 deposited under accession number 

5 ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ey29_8 deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ey29_8 deposited under accession number ATCC 

10 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone ey29_8 deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:18; 

15 (i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO: 18 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:18; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

2 0 (k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; and 

(I) a polynucleotide that hybridizes imder stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

Preferably^ such polynucleotide comprises the nucleotide sequence of SEQ ID 

2 5 NO: 17 from nucleotide 946 to nucleotide 2232; the nucleotide sequence of SEQ ID NO:17 

from nucleotide 1336 to nucleotide 1853; the nucleotide sequence of the full-length protein 
coding sequence of clone ey29_8 deposited under accession number ATCC 98444; or the 
nucleotide sequence of a mature protein coding sequence of clone ey29_8 deposited under 
accession number ATCC 98444. In other preferred embodiments, the polynucleotide 

3 0 encodes the full-length or a mature protein encoded by the cDNA insert of clone ey29.8 

deposited under accession number ATCC 98444. In yet other preferred embodiments, 
the present invention provides a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO:18 from amino acid 138 to amino acid 302. In further 
preferred embodiments, the present invention provides a polynucleotide encoding a 
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protein comprising a fragment of the amino acid sequence of SEQ ID NO: 18 having 
biological activity, the fragment preferably comprising eight (more preferably twenty, 
most preferably thirty) consecutive amino acids of SEQ ID NO: 18, or a polynucleotide 
encoding a protein comprising a fragment of the amino acid sequence of SEQ ID NO: 18 
having biological activity, the fragment comprising the amino acid sequence from amino 
acid 209 to amino acid 218 of SEQ ID NO:18, 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:17. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:18; 

(b) the amino acid sequence of SEQ ID NO:18 from amino acid 138 to 
amino acid 302; 

(c) fragments of the amino add sequence of SEQ ID NO:! 8 comprising 
eight consecutive amino acids of SEQ ID NO:18; and 

(d) the amino acid sequence encoded by the cDNA insert of done 
ey29_8 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO: 18 or the amino add sequence 
of SEQ ID NO:18 from amino acid 138 to amino acid 302. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino add sequence of SEQ ID NO: 18 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) consecutive amino 
acids of SEQ ID NO: 18, or a protein comprising a fragment of the amino add sequence of 
SEQ ID NO: 1 Shaving biological activity, the fragment comprising the amino add 
sequence from amino acid 209 to amino acid 218 of SEQ ID NO:18. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:lQ; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:19 from nucleotide 2588 to nucleotide 3439; 
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(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 3005 to nucleotide 3502; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone gnill4_10 deposited under accession 

5 number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone gmll4_10 deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone gmll4_10 deposited under accession number 

10 ATCC 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone gmll4_10 deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:20; 

15 (i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:20 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:20; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
{a)-(g) above; 

2 0 (k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; and 

(I) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 

2 5 NO:19 from nucleotide 2588 to nucleotide 3439; the nucleotide sequence of SEQ ID NO:19 

from nucleotide 3005 to nucleotide 3502; the nucleotide sequence of the full-length protein 
coding sequence of clone gmll4_10 deposited under accession number ATCC 98444; or 
the nucleotide sequence of a mature protein coding sequence of clone gmll4_10 deposited 
under accession number ATCC 98444. In other preferred embodiments, the 

3 0 polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 

of clone gmll4_10 deposited under accession number ATCC 98444. In yet other 
preferred embodiments^ the present invention provides a polynucleotide encoding a 
protein comprising the amino acid sequence of SEQ ID NO:20 from amino acid 145 to 
amino acid 284. In further preferred embodiments, the present invention provides a 
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polynucleotide encoding a protein comprising a fragment of the amino acid sequence of 
SEQ ID NO'20 having biological activity, the fragment preferably comprising eight (more 
preferably twenty, most preferably thirty) consecutive amino acids of SEQ ID NO:20, or 
a polynucleotide encoding a protein comprising a fragment of the amino acid sequence 
5 of SEQ ID NO:20 having biological activity, the fragment comprising the amino acid 
sequence from amino acid 137 to amino acid 146 of SEQ ID NO:20. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:19. 

In other embodiments, the present invention provides a composition comprising 
10 a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO;20; 

(b) the amino acid sequence of SEQ ID NO:20 from amino acid 145 to 
amino acid 284; 

1 5 (c) fragments of the amino acid sequence of SEQ ID NO:20 comprising 

eight consecutive amino acids of SEQ ID NO:20; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
gmll4_10 deposited under accession number ATCC 98444; 
the protein being substantially free from other mammalian proteins. Preferably such 
2 0 protein comprises the amino acid sequence of SEQ ID NO'-20 or the amino acid sequence 
of SEQ ID NO:20 from amino acid 145 to amino acid 284. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) consecutive amino 

2 5 acids of SEQ ID NO:20, or a protein comprising a fragment of the amino add sequence of 

SEQ ID NO:20having biological achvity, the fragment comprising the amino add 
sequence from amino acid 137 to amino acid 146 of SEQ ID NO:20. 

In certain preferred embodiments, the polynucleotide is operably linked to an 
expression control sequence. The invention also provides a host cell, induding bacterial, 

3 0 yeast, insect and mammalian cells, transformed with such polynucleotide compositions. 

Also provided by the present invention are organisms that have enhanced, reduced, or 
modified expression of the gene(s) corresponding to the polynucleotide sequences 
disdosed herein. - 

Processes are also provided for produdng a protein, which comprise: 
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(a) growing a culture of the host cell transformed with such 
polynucleotide compositions in a suitable culture medium; and 

(b) purifying the protein from the culture. 

The protein produced according to such methods is also provided by the present 
5 invention. 

Protein compositions of the present invention may further comprise a 
pharmaceutically acceptable carrier. Compositions comprising an antibody which 
specifically reacts with such protein are also provided by the present invention. 

Methods are also provided for preventing, treating or ameliorating a medical 
10 condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition comprising a protein of the present invention and a 
pharmaceutically acceptable carrier. 

BRIEF DESCRIPTION OF THE DRAWINGS 
1 5 Figures 1 A and IB are schematic representations of the pED6 and pNOTs vectors, 

respectively, used for deposit bf clones disclosed herein. 

DETAILED DESCRIPTION 
ISOLATED PROTEINS AND POLYNUCLEOTIDES 
2 0 Nucleotide and amino acid sequences, as presently determined, are reported 

below for each done and protein disclosed in the present application. The nucleotide 
sequence of each clone can readily be determined by sequencing of the deposited clone 
in accordance with known methods. The predicted amino add sequence (both full-length 
and mature forms) can then be determined from such nucleotide sequence. The amino 

2 5 acid sequence of the protein encoded by a particular clone can also be determined by 

expression of the clone in a suitable host cell collecting the protein and determining its 
sequence- For each disclosed protein applicants have identified what they have 
determined to be the reading frame best idenhfiable with sequence information available 
at the time of filing. 

3 0 As used herein a "secreted" protein is one which, when expressed in a suitable host 

cell, is transported across or through a membrane, induding transport as a result of signal 
sequences in its amino acid sequence. "Secreted" proteins indude without limitation 
proteins secreted wholly (e.g., soluble proteins) or partially (e.g. , receptors) from the cell 
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in which they are expressed. "Secreted" proteins also include without limitation proteins 
which are transported across the membrane of the endoplasmic reticulum. 

Clone "as294 3" 

5 A polynucleotide of the present invention has been identified as clone "as294_3". 

as294_3 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the arriino acid sequence of the encoded protein. as294_3 is a full-length clone, 
10 including the entire coding sequence of a secreted protein (also referred to herein as 
"as294_3 protein"). 

The nucleotide sequence of as294_3 as presently determined is reported in SEQ ID 
NO:l- What applicants presently believe to be the proper reading frame and the predicted 
amino acid sequence of the as294_3 protein corresponding to the foregoing nucleotide 

15 sequence is reported in SEQ ID NO:2. Amino adds 73 to 85 are a predicted leader/signal 
sequence, with the predicted mature amino acid sequence beginning at amino acid 86, or 
are a transmembrane domain. Amino acids 102 to 114 are also a possible leader/signal 
sequence, with the predicted mature amino acid sequence beginning at amino acid 115, 
or are a transmembrane domain. 

2 0 The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

as294_3 should be approximately 1900 bp. 

The nucleotide sequence disclosed herein for as294_3 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. as294_3 demonstrated at least some similarity with sequences 

25 identified as AA206777 (zq80d04,sl Stratagene hNT neuron (#937233) Homo sapiens 
cDNA clone 64791 1 3'), AA206905 (zq80d04.rl Stratagene hNT neuron (#937233) Homo 
sapiens cDNA clone 6479 11 5'), AA280222 (zt04c05.rl NCT_CGAP_GCB 1 Homo sapiens 
cDNA clone IMAGE712136 5'), H19869 (yn57a08.sl Homo sapiens cDNA clone 172502 
30, H24249 (ym50hl2.rl Homo sapiens cDNA clone 52050 5'), N44936 (yy34fll.rl 

30 * Homo sapiens cDNA clone 273165 5'), R15379 (yf90f03.rl Homo sapiens cDNA clone 
29694 5'), R43727 (yg20cll.sl Homo sapiens cDNA clone 32810 '3'), R88673" 
(ym93f09.rl Homo sapiens cDNA clone 166505 5'), T21648 (Human gene signature 
HUMGS03085), T80165 (5p IMAGE clone), and Z99260 (GenPept S. pombe hypothetical 
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protein). The predicted amino acid sequence disclosed herein for as294_3 was searched 
against the GenPept and GeneSeq amino acid sequence databases using the BLASTX 
search protocol. The predicted as294_3 protein demonstrated at least some similarity to 
sequences identified as X73434 (KAP5.4 keratin protein [Ovis aries]) and Z99260 
5 (hypothetical protein [Schizosaccharomyces pombe]). Based upon sequence similarity, 
as294_3 proteins and each similar protein or peptide may share at least some activity. The 
TopPredll computer program predicts three potential transmembrane domains within the 
as294_3 protein sequence, centered around amino acids 105, 228, and 307 of SEQ ID NO:2, 
respectively. 

10 

Clone "aw92 1" 

A polynucleotide of the present invention has been identified as clone "aw92_r'. 
aw92_l was isolated from a cDNA library of human adult ovary (comprising untreated 
tissue and tissue treated with retinoic add and activin), using methods which are selective 
15 for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was identified as 
encoding a secreted or transmembrane protein on the basis of computer analysis of the 
amino acid sequence of the encoded protein. aw92_l is a full-length clone, including the 
entire coding sequence of a secreted protein (also referred to herein as "aw92_l protein"). 
The nucleotide sequence of aw92_l as presently determined is reported in SEQ ID 
2 0 NO:3. What applicants presently believe to be the proper reading frame and the predicted 
amino acid sequence of the aw92_l protein corresponding to the foregoing nucleotide 
sequence is reported in SEQ ID NO:4. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
aw92„l should be approximately 2950 bp. 

2 5 The nucleotide sequence disclosed herein for aw92_l was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/ BLASTX and 
FASTA search protocols. aw92_l demonstrated at least some similarity with sequences 
identified as AF021936 (Rattus norvegicus myotonic dystrophy kinase-related Cdc42- 
binding kinase MRCK-beta (MRCK-beta) mRNA, complete CDs, GP2736153), T23529 

3 0 (seq3368 Homo sapiens cDNA clone Hyl8-Charon40-cDNA-247 3'), U59305 (Human 

ser-thr protein kinase PK428 mRNA, complete cds), W16524 (zbl5h09.rl Soares fetal 
lung NbHL19W Homo sapiens cDNA clone 302177 5' similar to PIR A42101 A42101 
protein kinase homolog - human; contains element MER22 repetitive element), and 

25 
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X69292 (H. sapiens mRNA for smooth muscle myosin). The predicted amino acid 
sequence disclosed herein for aw92_l was searched against the GenPept and GeneSeq 
amino acid sequence databases using the BLASTX search protocol. The predicted aw92_l 
protein demonstrated at least some similarity to sequences identified as L03534 
5 (ENHMHCAX_1 myosin heavy chain [Entamoeba histolytica]), R41000 (fluman brain 
cDNA clone C28 protein kinase), U59305 (ser-thr protein kinase PK428 [Homo sapiens]), 
W02258 (Nucieolar/endosomal auto-antigen pl62), and X03740 (myosin heavy chain (876 
AA) [Homo sapiens]). Based upon sequence similarity', aw92_l proteins and each similar 
protein or peptide may share at least some activity. 

10 

Clone'Tjd316 T 

A polynucleotide of the present invention has been idenhficd as clone "bd316_2". 
bd316_2 was isolated from a human fetal kidney cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
15 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. bd316_2 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"bd316_2 protein"). 

The nucleotide sequence of bd316_2 as presently determined is reported in SEQ 
2 0 ID NO:5. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the bd316_2 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:6. Amino acids 32 to 44 are a predicted 
leader/signal sequience, with the predicted mature amino acid sequence beginning at 
amino acid 45, or are a transmembrane domain. 

2 5 The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

bd316_2 should be approximately 1200 bp. 

The nucleotide sequence disclosed herein for bd316_2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using DLASTN/ BLASTX and 
FAST A search protocols. bd316_2 demonstrated at least some similarity with sequences 

3 0 identified as AA234339 (zr72dl2.sl Scares NhHMPu SI Homo sapiens cDNA clone 

668951 3'). L05367 (Human oligodendrocyte myelin glycoprotein (OMG) cxons 1-2; 
neurofibromatosis 1 (NFl) exons 28-49; ecotropic viral integration site 2B (EVI2B) exons 
1-2; ecotropic viral integration site 2 A (EVI2A) exons 1-2; adenylate kinase (AK3) exons 
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1-2). N30778 (yw74h08.sl Homo sapiens cDNA clone 258015 3' similar to 
gblM73048IHUMU3AAAA Human U3 small nuclear RNA (rRNA);contains MERi2.tl 
MER12 repeiitive element), U52195 (Human desmoglein 3 gene, promoter region), 
U60822 (Human dystrophin (DMD) gene, exons 7, 8 and 9, and panial cds), X85184 
5 (R.norvegicus mRNA for ras-related GTPase. ragB), and X90530 (H.sapiens mRNA for 
ragB protein). Based upon sequence similarity, bd316_2 proteins and each similar protein 
or peptide may share at least some activity. The TopPredll computer program predicts 
a potential transmembrane domain within the bd316_2 protein sequence centered around 
amino acid 35 of SEQ ID NO:6. 

10 

Clone "bkl30 4" 

A polynucleotide of the present invention has been identified as clone "bkl30_4". 
bkl30_4 was isolated from a human adult retina cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
15 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. bkl30_4 is a full-length clone, 
including the enhre coding sequence of a secreted protein (also referred to herein as 
"bkl30_4 protein"). 

The nucleotide sequence of bkl30_4 as presently determined is reported in SEQ 
2 0 ID NO:7. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the bkl3G_4 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:8. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
bkl30_4 should be approximately 550 bp. 

2 5 The nucleotide sequence disclosed herein for bkl30_4 was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/BL-'VSTX and 
FAST A search protocols. bkl30_4 demonstrated at least some similarity with sequences 
identified as AA009736 (2e82e04.sl Soares fetal heart NbHH19W Homo sapiens cDNA 
clone 365502 3'), AAl 12971 (zn59b09.rl Stratagene muscle 937209 Homo sapiens cDNA 

3 0 clone 562457 5'), AA 1 96543 (zqOSe 12.s I Stratagene muscle 937209 Homo sapiens cDNA 

clone 629134 3'), AA196677 (zq08el0.rl Stratagene muscle 937209 Homo sapiens cDNA 
clone 629 1 30 5'), AA232667 (zr74e 1 0.s I Soares NhHMPu S 1 Homo sapiens cDNA clone 
669162 3'), H26737 (yll4fl2.rl Homo sapiens cDNA clone 158255 5'), H44642 
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(yp20a08.rl Homo sapiens cDNA clone 187958 5'), and W72771 (zd77cl2,rl Soares fetal 
heart NbHH19W Homo sapiens cDNA clone 346678 5'). The predicted amino acid 
sequence disclosed herein for bkl30_4 was searched against the GenPept and GeneSeq 
amino acid sequence databases using the BLASTX search protocol. The predicted bkl30_4 
protein demonstrated at least some similarity to sequences identified as L 11 647 (glycogen 
branching enzyme [Streptomyces aureofaciens]), L2365l( homology with C. elegans 
cuticle collagen; putative [Caenorhabditis elegans]), W03740 (rchd528 gene product), and 
Z29095 (RlOEl 1.1 [Caenorhabditis elegans]). Based upon sequence similarity, bkl30_4 
proteins and each similar protein or peptide may share at least some activity. 

Clone "bvl31 5" 

A polynucleotide of the present invention has been identified as clone "bvl31_5". 
bvl31_5 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. bvl31_5 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"bvl31_5 protein"). 

The nucleotide sequence of bvl31_5 as presently determined is reported in SEQ 
ID NO:9, What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the bvl31_5 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:10. Amino acids 377 to 389 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence begirming at 
amino acid 390, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
bvl31_5 should be approximately 2900 bp. 

The nucleotide sequence disclosed herein for bvl31_5 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/ BLASTX and 
FASTA search protocols. bvl31_5 demonstrated at least some similarity with sequences 
identified as AA2335I0 (2r29h03.rl Straiagene NT2 neuronal precursor 937230 Homo 
sapiens cDNA clone 664853 5' similar to TR:GII51007 Gl 151007 ATP DEPENDENT 
PERMEASE), H24176 (ym55e05.rl Homo sapiens cDNA clone 52176 5"), R13832 
(yf65a02.rl Homo sapiens cDNA clone 26986 5' similar to SP:ADP1_YEAST P25371 
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PROBABLE ATP-DEPENDENT PERMEASE), R 16423 (yf40d03.rl Homo sapiens 
cDNA clone 129317 5'), T00880 (Human cispiatin resistance gene cDNA62), T 123 16 
(Replicable and transcriptionally active plasmid), T7887I (yd83b08.sl Homo sapiens 
cDNA clone 114807 3'), U66681 (Human clone EST 157481 ATP-binding cassette 
5 ir^sporter mRN A sequence), and V007 1 0 (Human mitochondrial genes for several tRNAs 
(Phe, Val, Leu) and 12S and i6S ribosomal RNAs). The predicted amino acid sequence 
disclosed herein for bvl31_5 was searched against the GenPept and GeneSeq amino acid 
sequence databases using the BLASTX search protocol. The predicted bvl31_5 protein 
demonstrated at least some similarity to sequences identified as U349 19 (white homolog 

10 [Homo sapiens]), Z48745 (murine ABC8), and Z49821 (putative ABC transponer 
[Saccharomyces cerevisiae]). Based upon sequence similarity, bvl31_5 proteins and each 
similar protein or peptide may share at least some activity. The TopPredll computer 
program predicts five additional potential transmembrane domains within the bvl31_5 
protein sequence, centered around amino acids 354, 439, 463, 494 and 588 of SEQ ID 

15 KO:10, respectively. - - 

Clone "bv227 1" 

A polynucleotide of the present invention has been identified as clone "bv227_l*'. 
bv227_l was isolated from a human adult brain cDNA library using methods which are 
2 0 selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5^36,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. bv227_l is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"bv227_l protein"). 

2 5 The nucleotide sequence of bv227_l as presently determined is reported in SEQ 

ID NO: 11. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the bv227_l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO: 12. Amino acids 45 to 57 of SEQ ID NO: 12 
are a possible leader /signal sequence, with the predicted mature amino acid sequence 

3 0 beginning at amino add 58, or are a transmembrane domain. Another potential bv227_l 

reading frame and predicted anuno acid sequence is encoded by. basepairs 921 to 2294 of 
SEQ ID NO:ll and is reported in SEQ ID NO:31. A frameshift in the nucleotide sequence 
of SEQ ID NO:ll between about nucleotide 664 to about nucleotide 690 could extend the 



29 



wo 98/55614 PCT/US98/1 1210 



reading frame of SEQ ID NO:31 to form a reading frame extending from position 666 to 
2294 of SEQ ID NQll and encoding the amino acid sequence reported in SEQ ID IMO:32. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing done 
bv227_l should be approximately 3300 bp. 
5 The nucleotide sequence disclosed herein for bv227_l was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. bv227__l demonstrated at least some similarity with sequences 
identified as AA368932 (EST80282 Placenta I Homo sapiens cDNA similar to similar to 
beta- 1 -glycoprotein PSGGA, pregnancy-specific), D60272 (Human fetal brain cDNA 

10 3 -end GEN-095A07), M58526 (Human alpha-5 collagen t^'pe TV (COL4A5) mRNA, 3' 
end), Q64556 (Human collagen (Type V) coding sequence), R74388 (yi57fl l.sl Homo 
sapiens cDNA clone 143373 3'), and T67066 (Human alpha3(DC) collagen cDNA). The 
predicted amino acid sequences disclosed herein for bv227_l were searched against the 
GenPept and GeneSeq amino acid sequence databases using the BLASTX search protocol. 

15 The predicted bv227_l proteins of SEQ ID NO:31 and SEQ ID NO:32 demonstrated at 
least some similarity to sequences identified as 557 132 (type XVI collagen alpha 1 chain, 
alpha 1 (XVI) [human, placenta. Peptide Partial, 1 186 aa] [Homo sapiens]) and W07539 
(Collagen like protein (CLP)). Based upon sequence similarity, bv227_l proteins and each 
similar protein or peptide may share at least some activity. 

20 ' 

aone"cd265 U" 

A polynxicleotide of the present invention has been identified as clone "cd265_ll*'. 

cd265_ll was isolated from a human fetal brain cDNA library using methods which are 

selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
25 identified as encoding a secreted or transmembrane protein on the basis of computer 

analysis of the amino add sequence of the encoded protein. cd265_ll is a full-length 

clone, including the entire coding sequence of a secreted protein (also referred to herein 

as "cd265_ll protein"). 

The nucleotide sequence of cd265_ll as presently determined is reported in SEQ 
3 0 ID NO:13. What applicants presently believe to be the proper reading frame and the 

predicted amino acid sequence of the cd265_ll protein corresponding to the foregoing 

nucleotide sequence is reported in SEQ ID NO:14. 
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The EcoRI/NotI restriction fragment obtainable from the deposit containing done 
cd265_ll should be approximately 1600 bp. 

The nucleotide sequence disclosed herein for cd265_ll was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
5 FASTA search protocols. cd265_ll demonstrated at least some similarity with sequences 
identified as AA125395 (mp77f05.rl Scares 2NbMT Mus muscuius cDNA clone 575265 
5*), AA 13 1340 (zoOShOl.sl Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens 
cDNA clone 567121 3'), AA244194 (nc06bl l.sl NCLCGAP.Prl Homo sapiens cDNA 
clone 1462), AA339557 (EST44738 Fetal brain I Homo sapiens cDNA 5' end), AA569649 
10 (nf24all.sl NCI_CGAP_Prl Homo sapiens cDNA clone IMAGE:9 14684), and T26052 
(Human gene signature HUMGS08288). Based upon sequence similarity, cd265_ll 
proteins and each similar protein or peptide may share at least some activity. 

Clone "ei265 4" 

15 A polynucleotide of the present invention has been identified as clone "ej265_4", 

ej265_4 was isolated from a human- adult placenta cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (siee U.S. Pat No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. ej265_4 is a full-length clone, 

2 0 including the entire coding sequence of a secreted protein (also referred to herein as 

"ej265„4 protein"). 

The nucleotide sequence of ej265_4 as presently determined is reported in SEQ ID 
NO:15. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the ej265_4 protein corresponding to the foregoing 
25 nucleotide sequence is reported in SEQ ID NO: 16. Amino acids 11 to 23 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 24, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable.from the deposit containing clone 
ej265_4 should be approximately 1200 bp. 

3 0 The nucleotide sequence disclosed herein for ej265_4 was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. ej265_4 demonstrated at least some similarity with sequences 
identified as D79053 (Human placenta cDNA 5'-end GEN-530B 12), H63 156 (yr50c03.r 1 
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Homo sapiens cDNA clone 208708 5*), H64584 (yul4al2.rl Homo sapiens cDNA clone 
233758 5'), and T49682 (ya78f lO.rl Homo sapiens cDNA clone 678 19 5*). The predicted 
annino acid sequence disclosed herein for ej265„4 was searched against the GenPept and 
GeneSeq amino acid sequence databases using the BLASTX search protocol. The 
5 predicted ej265_4 protein demonstrated at least some similarity to sequences identified 
as endothelial leukocyte adhesion molecule 1, Based upon sequence similarity, ej265_4 
proteins and each similar protein or peptide may share at least some activity. 

Clone "ev29 8" 

10 A polynucleotide of the present invention has been identified as clone "ey29_8". 

ey29_8 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. ey29_8 is a full-length clone, 

15 including the entire coding sequence of a secreted protein (also referred to herein as 
"ey29_8 protein"). 

The nucleotide sequence of ey29_8 as presently determined is reported in SEQ ID 
NO: 17. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the ey29„8 protein corresponding to the foregoing 
2 0 nucleotide sequence is reported in SEQ ID NO:18. Annino acids 47 to 59 are a possible 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 60. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
ey29_8 should be approximately 4000 bp. 

2 5 The nucleotide sequence disclosed herein for ey29_8 was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/ BLASTX and 
FASTA search protocols. ey29_8 demonstrated at least some similarity with sequences 
identified as AA262521 (zsl7b02.rl Soares NbHTGBC Homo sapiens cDNA clone 
685419 5'), AA429923 (zw66g01 .si Soares testis NHT Homo sapiens cDNA clone 78 1200 

3 0 3'), AA446080 (zw66g03.rl Soares testis NHT Homo sapiens cDNA clone 781204 5'), 

"F07905 (H. sapiens partial cDNA sequence; clone c-21b06), U25125 (Callus gallus 
preprogastrin gene, complete cds), W92743 (zd92g06.sl Soares fetal heart NbHH19W 
Homo sapiens cDNA clone 356986 3'), and Z44092 (H. sapiens partial cDNA sequence; 
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clone c- lsd04). Based upon sequence similarity, ey29_8 proteins and each similar protein 
or peptide may share at least some activity. The TopPredll computer program predicts 
two potential transmembrane domains within the ey29_8 protein sequence, one centered 
around amino acid 120 and another around amino acid 410 of SEQ ID NO:18. 

Clone "gmll4_ 10" 

A polynucleotide of the present invention has been identified as clone "gmll4_10". 
gmll4_10 was isolated from a human adult uterus cDNA library using methods which 
are selective for cDNAs encoding secreted proteins (see U.S. Pat No. 5^36,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. gmll4_10 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "gmll4_10 protein"). 

The nucleotide sequence of gmll4_10 as presently determined is reported in SEQ 
ID NO:19. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the gmll4_10 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:20. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
gmll4_10 should be approximately 4000 bp. 

The nucleotide sequence disclosed herein for gmll4_10 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. gmll4_10 demonstrated at least some similarity with sequences 
identified as AC002350 (Homo sapiens; HTGS phase 1, 46 unordered pieces). H96041 
(yw61b08.rl Scares placenta 8to9weeks 2NbHP8to9W Homo sapiens cDNA clone 256695 
5'), L02529 (Rattus norvegicus Drosophila polarity gene (frizzled) homologue mRNA, 
complete cds), N70776 (2a72g04.sl Homo sapiens cDNA clone 298134 3'), N96041, 
N92163 (yz89b04.rl Homo sapiens cDNA clone 290191 5'), U20865 (Saccharomyces 
cerevisiae chromosome Xn cosmid 9672), and W93041 (2d93e07.sl Soares fetal heart 
NbHH19W Homo sapiens cDNA clone 357060 3\ The predicted amino acid sequence 
disclosed herein for gmll4_10 was searched against the GenPept and GeneSeq_cimino acid 
sequence databases using the BLASTX search protocol. The predicted gmll4_10 protein 
demonstrated at least some similarity to sequences identified as U20865 (chromosome XU 
cosmid 9672 [Saccharomyces cerevisiae], similar to C. elegans hypothetical protein 
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C34E10.2 (GenBank accession number U 10402)). Based upon sequence similarity, 
gTnll4_10 proteins and each sinnilar protein or peptide may share at least some activity. 
The TopPredll computer program predicts a potential transmembrane domain within the 
gmll4_10 protein sequence centered around amino acid 150 of SEQ ID NO:20- 

. 5 

Deposit of Clones 

Clones as294_3, aw92_l, bd316_2, bkl30_4, bvl31^5, bv227_l, cd265_ll, ej265_4, 
ey29_8, and gmll4_10 were deposited on June 3, 1997 with the American Type Culture 
Collection (10801 University Boulevard, Manassas, Virginia 20110-2209 U.S.A.) as an 

10 original deposit under the Budapest Treaty and were given the accession number ATCC 
98444, from which each clone comprising a particular polynucleotide is obtainable. All 
restrictions on the availability to the public of the deposited material will be irrevocably 
removed upon the granting of the patent, except for the requirements specified in 37 
C.F.R. § 1.808(b), and the term of the deposit will comply with 37 C.F.R. § 1.806. 

1 B Each clone has been transfected into separate bacterial cells (E. coli) in this 

composite deposit. Each clone can be removed from the vector in which it was deposited 
by performing an EcoRI/NotI digestion (5* site, EcoRI; 3' site, NotI) to produce the 
appropriate fragment for such clone. Each clone was deposited in either the pED6 or 
pNOTs vector depicted in Figures lA and IB, respectively. The pED6dpc2 vector 

2 0 ("pED6") was derived from pED6dpcl by insertion of a new polylinker to facilitate 
cDNA cloning (Kaufman et aL, 1991, Nucleic Acids Res. 19: 4485-4490); the pNOTs vector 
was derived from pMT2 (Kaufman et aL, 1989, MoL Cell Biol 9: 946-958) by deletion of 
the DHFR sequences, insertion of a new polylinker, and insertion of the M13 origin of 
replication in the Clal site. In some instances, the deposited clone can become "flipped" 

2 5 (i.e., in the reverse orientation) in the deposited isolate. In such instcinces, the cDNA insert 

can still be isolated by digestion with EcoRI and Notl. However, NotI will then produce 
the 5' site and EcoRI will produce the 3' site for placement of the cDNA in proper 
orientation for expression in a suitable vector. The cDNA may also be expressed from the 
vectors in which they were deposited. 

3 0 Bacterial cells containing a particular clone can- be obtained from the composite 

deposit as follows: 

An oligonucleotide probe or probes should be designed to the sequence that is 
known for that particular clone. This sequence can be derived from the sequences 
provided herein, or from a combination of those sequences. The sequence of an 
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oligonucleotide probe that was used to isolate or to sequence each full-length clone is 
identified below, and should be most reliable in isolating the clone of interest. 



Clone 


Probe Seauence 


as294_3 


SEQ ID NO:21 


aw92_l 


SEQ ID NO:22 


bd316_2 


SEQ ID N'0:23 


bkl30_4 


SEQ ID NO:24 


bvl31_5 


SEQ ID NO:25 


bv227_l 


SEQ ID NO;26 


cd265_ll 


SEQ ID NO:27 


ej265_4 


SEQ ID NO:28 


ey29_8 


SEQ ID NO:29 


gmll4_10 


SEQ ID NO:30 



15 

In the sequences listed above which include an N at position 2, that position is occupied 
in preferred probes/ primers by a biotinylated phosphoaramidite residue rather than a 
nucleotide (such as , for example, that produced by use of biotin phosphoramidite (1- 
dimethoxytrityloxy-2-(N-biotinyl-4-aminobutyl)-propyl-3-0-(2-cyanoethyl)-(N,N- 

2 0 diisopropyl)-phosphoramadite) (Glen Research, cat no. lU-1953)). 

The design of the oligonucleotide probe should preferably follow these 
parameters: 

(a) It should be designed to an area of the sequence which has the fewest 
ambiguous bases ("N's"), if any; 
25 (b) It should be designed to have a of approx. 80 ° C (assuming 2° for each 

A or T and 4 degrees for each G or C). 
The oligonucleotide should preferably be labeled with g-^^P ATP (specific activity 6000 - 
Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for 
labeling oligonucleotides. Other labeling techniques can also be used. Unincorporated 

3 0 label should preferably be removed by gel filtration chromatography or other established 

methods. The amount of radioactivity incorporated into the probe should be quantitated 
by measurement in a scintillation counter. Preferably, specific activity of the resulting 
probe should be approximately 4e+6 dpm/pmole. 
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The bacterial culture containing the pool of full-length clones should preferably 
be thawed and 100 |al of the stock used to inoculate a sterile culture flask containing 25 ml 
of sterile L-broth containing ampicillin at 100 pg/ml. The culture should preferably be 
grown to saturation at 37°C, and the saturated culture should preferably be diluted in 
5 fresh L-broth. Aliquots of these dilutions should preferably be plated to determine the 
dilution and volume which will yield approximately 5000 distinct and well-separated 
colonies on solid bacteriological media containing L-broth containing ampicillin at 100 
|ig/ml and agar at 1.5% in a 150 mm petri dish when grown overnight at 37°C. Other 
known methods of obtaining distinct, well-separated colonies can also be employed. 

10 Standard colony hybridization procedures should then be used to transfer the 

colonies to nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 
6X SSC (20X stock is 175.3 g NaCl/Uter, 88.2 g Na citrate/ liter, adjusted to pH 7.0 with 
NaOH) containing 0.5% SDS, 100 ^g/ml of yeast RN A, and 10 mM EDTA (approximately 

15 10 mL per 150 mm filter). Preferably, the probe is then added to the hybridization mix at 
a concentration greater than or equal to le-h6 dpm/mL. The filter is then preferably 
incubated at 65°C with gentle agitation overiught. The filter is then preferably washed in 
500 mL of 2X SSC/0.5% SDS at room temperature without agitation, preferably followed 
by 500 mL of 2X SSC/0.1% SDS at room temperature with gentle shaking for 15 minutes. 

2 0 A third wash with O.IX SSC/0-5% SDS at SS^'C for 30 minutes to 1 hour is optional. The 
filter is then preferably dried and subjected to autoradiography for sufficient time to 
visualize the positives on the X-ray film. Other known hybridization methods can also 
be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated 

2 5 using standard procedures. The clones can then be verified by restriction analysis, 

hybridization analysis, or DNA sequencing. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the 
protein may be in linear form or they may be cyclized using known methods, for example, 

3 0 as described in H.U. Saragovi, et al. Bio /Technology 10, 773-778 (1992) and in R.S. 

McDowell e-t al, J, Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated 
herein by reference. -Such fragments may be fused to carrier molecules such, as 
immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. For example, fragments of the protein may be fused through "linker" sequences to 

- 36 - 
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the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion 
could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also 
be used to generate such fusions. For example, a protein - IgM fusion would generate a 
decavalent form of the protein of the invention. 

The present invention also provides both full-length and mature forms of the 
disclosed proteins. The full-length form of the such proteins is identified in the sequence 
listing by translation of the nucleotide sequence of each disclosed clone. The mature 
form(s) of such protein may be obtained by expression of the disclosed full-length 
polynucleotide (preferably those deposited with ATCC) in a suitable mammalian cell or 
other host cell. The sequence(s) of the mature form(s) of the protein may also be 
determinable from the amino acid sequence of the full-length form. 

The present invention also provides genes corresponding to the polynucleotide 
sequences disclosed herein. "Corresponding genes" are the regions of the genome that 
are transcribed to produce the mRNAs from which cDNA polynucleotide sequences are 
derived and may include contiguous regions of the genome necessary for the regulated 
expression of such genes. Corresponding genes may therefore include but are not limited 
to coding sequences, 5' and 3' untranslated regions, alternatively spliced exons, introns, 
promoters, enhancers, and silencer or suppressor elements. The corresponding genes can 
be isolated in accordance with known methods using the sequence information disclosed 
herein. Such methods include the preparation of probes or primers from the disclosed 
sequence information for identification and /or amplification of genes in appropriate 
genomic libraries or other sources of genomic materials. An "isolated gene" is a gene that 
has been separated from the adjacent coding sequences, if any, present in the genome of 
the organism from which the gene was isolated. 

Organisms that have enhanced, reduced, or modified expression of the gene(s) 
corresponding to the polynucleotide sequences disclosed herein are provided. The 
desired change in gene expression can be achieved through the use of antisense 
polynucleotides or ribozymes that bind and /or cleave the mRNA transcribed from the 
gene (Albert and Morris, 1994, Trends PharmacoL Sci. 15(7): 250-254; Lavarosky et aL, 1997, 
Biochem, Mol, Med. 62(1): 11-22; and Hampel, 1998, Prog. NucAeic Acid Res, Mol BioL 58: 1- 
39; all of which are incorporated by reference herein). Transgenic animals that have 
multiple copies of the gene(s) corresponding to the polynucleotide sequences disclosed 
herein, preferably produced by transformation of cells with genetic constructs that are 
stably maintained within the transformed cells and their progeny, are provided. 
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Transgenic animals that have modified genetic control regions that increase or reduce 
gene expression levels, or that change temporal or spatial patterns of gene expression, are 
also provided (see European Patent No. 0 649 464 Bl, incorporated by reference herein). 
In addition, organisms are provided in which the gene(s) corresponding to the 
5 polynucleotide sequences disclosed herein have been partially or completely inactivated, 
through insertion of extraneous sequences into the corresponding gene(s) or through 
deletion of all or part of the corresponding gene(s). Partial or complete gene inactivation 
can be accomplished through insertion, preferably follov^ed by imprecise excision, of 
transposable elements (Plasterk, 1992, Bioessays 14(9): 629-633; Zwaal et ai, 1993, Proc. Natl. 

10 Acad. ScL USA 90(16): 7431-7435; Clark et al, 1994, Proc. Natl Acad. Sci. USA 91(2): 719-722; 
all of which are incorporated by reference herein), or through homologous recombination, 
preferably detected by positive /negative genehc selection strategies (Mansour et aL, 1988, 
Nature 336: 348-352; U.S. Patent Nos. 5,464,764; 5,487,992; 5,627,059; 5,631,153; 5,614, 396; 
5,616,491; and 5,679,523; all of which are incorporated by reference herein). These 

15 organisms with altered gene expression are preferably eukaryotes and more preferably 
are mammals. Such organisms are useful for the development of non-human models for 
the study of disorders involving the corresponding gene(s), and for the development of 
assay systems for the identification of molecules that interact with the protein product(s) 
of the corresponding gene(s). 

2 0 Where the protein of the present invention is membrane-bound (e.g., is a receptor), 

the present invention also provides for soluble forms of such protein. In such forms part 
or all of the intracellular and transmembrane domains of the protein are deleted such that 
the protein is fully secreted from the cell in which it is expressed. The intracellular and 
transmembrane domains of proteins of the invention can be identified in accordance with 

2 5 known techniques for determination of such domains from sequence information. 

Proteins and protein fragments of the present invention include proteins with 
amino add sequence lengths that are at least 25%(more preferably at least 50%, and most 
preferably at least 75%) of the length of a disclosed protein and have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90% or 95% 

3 0 identity) with that disclosed protein, where sequence identity is determined by comparing 

the amino acid sequences of the proteiris when aligned so as to maximize overlap and 
identity while minimizing sequence gaps. Also included in the present invention are 
proteins and protein fragments that contain a segment preferably comprising 8 or more 
(more preferably 20 or more, most preferably 30 or more) contiguous amino adds that 
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shares at least 75% sequence identity (more preferably, at least 85% identity; most 
preferably at least 95% identity) with any such segment of any of the disclosed proteins. 

Species homologues of the disclosed polynucleotides and proteins are also 
provided by the present invention. As used herein, a '*spccics homologue" is a protein or 
5 polynucleotide with a different species of origin from that of a given protein or 
polynucleotide, but with significant sequence similarity to the given protein or 
polynucleotide. Preferably, polynucleotide species homologues have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90% idenhty) with 
the given polynucleotide, and protein species homologues have at least 30% sequence 

1 0 identity (more preferably, at least 45% identity; most preferably at least 60% identity) with 
the given protein, where sequence identity is determined by comparing the nucleotide 
sequences of the polynucleotides or the amino acid sequences of the proteins when 
aligned so as to maximize overlap and identity while minimizing sequence gaps. Species 
homologues may be isolated and identified by making suitable probes or primers from 

1 5 the sequences provided herein and screening a suitable nucleic acid source from the 
desired species. Preferably, species homologues are those isolated from mammalian 
species. Most preferably, species homologues are those isolated from certain mammalian 
species such as, for example. Pan troglodytes, Gorilla gorilla, Pongo pygmaeus, Hylohates 
concolor, Macaca mtilatta, Papio papio, Papio hamadryas, Cercopithecus aethiops, Cebus capiicinus, 

2 0 Aotiis trivirgatiis, Sangnimis ocdipiis, Microcebus murinus, Mus musculiis, Rattus norvegicus-, 
Cricetuliis griseus, Felis catus, Mustela vison, Canisfamiliaris, Oryctolagus cuniculus. Bos taiirus, 
Ovis aries, Sus scrofa, and Equus caballus, for which genetic maps have been created 
allowing the identification of syntenic relationships between the genomic organization of 
genes in one species and the genomic organization of the related genes in another species 

2 5 (O'Brien and Seuanez, 1988, Ann. Rev. Genet, 22: 323-351; O'Brien et at., 1993, Nature 

Genetics 3:103-112; Johansson ei al, 1995, Genomics 25: 682-690; Lyons et aL, 1997, Nature 
Genetics 15: 47-56; O'Brien et ai, 1997, Trends in Genetics 13(10): 393-399; Carver and Stubbs, 
1997, Genome Research 7:1123-1137; all of which are incorporated by reference herein). 

The invention also encompasses allelic variants of the disclosed polynucleotides 

3 0 or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotides 

which also encode proteins which are identical or have significantly similar sequences to 
those encoded by the disclosed polynucleotides. Preferably, allelic variants have at least 
60% sequence identity (more preferably, at least 75% identity; most preferably at least 90% 
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identity) with the given polynucleotide, where sequence identity is determined by 
comparing the nucleotide sequences of the polynucleotides when aligned so as to maximize 
overlap and identity while minimizing sequence gaps. Allelic variants may be isolated and 
identified by making suitable probes or primers from the sequences provided herein and 
screening a suitable nucleic acid source h-om individuals of the appropriate species. 

The invention also includes polynucleotides with sequences complementary to 
those of the polynucleotides disclosed herein. 

The present irwention also includes poI>Tiucleo tides that hybridize under reduced 
stringency conditions, more preferably stringent conditions, and most preferably highly 
stringent conditions, to polynucleotides described herein. Examples of stringency 
condidons are shown in the table below: highly stringent conditions are those that are at 
least as stringent as, for example, conditions A-F; stringent conditions are at least as 
stringent as, for example, conditions G-L; and reduced stringency conditions are at least 
as stringent as, for example, conditions M-R. 



40 



wo 98/55614 



PCT/US98/11210 



Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
Length 
(bp)* 


Hybridization Tennperature and 
Buffer^ 


Wash 

Temperature 
and Buffer' 


A 


DNA;DNA 


2 50 


eS^C; IxSSC -or- 

42°C; IxSSC, 50% formamide 


65"C;0.3xSSC 


B 


DNA:DNA 


<50 


Ta*; IxSSC 


Tb*; IxSSC 


C 


DNA:RNA 


i 50 


67 'C; IxSSC -or- 

45'C; IxSSC, 50% formamide 


67'' C; 03xS5C 


D 


DNA:RNA 


<50 


V; IxSSC 


To*; IxSSC 


E 


RNA:RNA 


> 50 


70**C; IxSSC -or- 

SO^'C; IxSSC, 50% formamide 


70°C; 0.3xS5C 


F 


RNA:RNA 


<50 


Tp*; IxSSC 


Tp*; IxSSC 


G 


DNA:DNA 


i 50 


65"C;4xSSC -or- 

42''C; 4xSSC, 50% formamide 


65 °C; IxSSC 


H 


DNA:DNA 


<50 


T„*; 4xS5C 


Th*; 4xS5C 


I 


DNA:RNA 


> 50 


67*'C; 4xS5C -or- 

45°C; 4xSSC, 50% formamide 


67°C; IxSSC 


I 


DNA:RNA 


<50 


T,*; 4xSSC 


T,*; 4xSSC 


K 


RNA:RNA 


i 50 


70**C; 4xSSC -or- 

50 °C; 4xSSC, 50% formamide 


67''C; IxSSC 


L 


RNA:RNA 


<50 


Tt 2xSSC 


T^*; 2xSSC 


M 


DNA:DNA 


2 50 


50 ^C; 4xSSC-or- 

40'C; 6xSSC, 50% formamide 


50**C; 2xSSC 




DNA:DNA 


<50 


T„*; 6xSSC 


T^*; 6xSSC 


O 


DNA:RNA 


? 50 


55 •'C; 4xSSC-or- 

42"C; 6xSSC, 50% formamide 


55°C;2xSSC 


P 


DNA:RNA 


<50 


T/; 6xS5C 


Tp*; 6xSSC 


Q 


RNA:RNA 


i 50 


eO'^C; 4xSSC-or- 

45'C; 6xSSC, 50% formamide 


60"C; 2xSSC 


R 


RNA:RNA 


<:50 


Tr*; 4xSSC 


Tr»; 4xSSC 



^: The hybrid length is that anticipated for the hybridized region(s) of the hybridizing polynucleotides. When 
hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed 
to be that of the hybridizing polynucleotide- When polynucleotides of known sequence are hybridized, the 

2 5 hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region 

or regions of optimal sequence complementarity. 

S5PE (IxSSPE is 0.15M NaCl, lOmM NaH^PO^, and l,25mM EDTA, pH 7 4) can be substituted for SSC 
{IxSSC is 0.15M NaCl and l5mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 15 minutes after hybridization is complete. 

3 0 - Tr: The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should 

be 5-10**C less than the melting temperature (T^J of the hybrid, where T„ is determined according to the 
following equations. For hybrids less than 18 base pairs in length, T^{°C) = 2(# of A + T bases) + 4(* of G + 
Cbases). For hybrids between 18 and 49 base pairs in length, T^CQ = 813 + 16,60og,Q[Na*l) +p.41(%G+C)- 
(600/ N), where N is the number of bases in the hybrid, and |Na*] is the concentration of sodium ions in the 
3 5 hybridization buffer ((Na'] for IxSSC = 0.165 M). 
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Additional examples of stringency conditions for polynucleotide hybridization are 
provided in Sambrook. J., E.F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
chapters 9 and 11, and Current Protocols in Molecular Biology, 1995, P.M. Ausubel et al., eds., 
5 John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

Preferably, each such hybridizing polynucleotide has a length that is at least 
25%(more preferably at least 50%, and most preferably at least 75%) of the length of the 
polynucleotide of the present invention to which it hybridizes, and has at least 60% 
sequence identity (more preferably, at least 75% identity; most preferably at least 90% or 
10 95% identity) with the polynucleotide of the present invention to which it hybridizes, 
where sequence idenhty is determined by comparing the sequences of the hybridizing 
polynucleotides when aligned so as to maximize overlap and identity'' while minimizing 
sequence gaps. 

The isolated polynucleotide of the invention may be operably linked to an 

15 expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et aL, Nucleic Acids Res. 19^ 4485-4490 (1991), in order to produce the protein 
recombinantly. Many suitable expression control sequences are known in the art. General 
methods of expressing recombinant proteins are also known and are exemplified in R. 
Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein "operably 

2 0 linked" means that the isolated polynucleotide of the invention and an expression control 
sequence are situated within a vector or cell in such a way that the protein is expressed 
by a host cell which has been transformed (transfected) with the ligated 
polynucieotide/expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the 

2 5 protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster 
Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 
cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, 
mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

.3 0 Altematively, it may be possible to produce the protein in lower eukaryotes such 

as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 
yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli. Bacillus subtilis. Salmonella typhimurium, or any bacterial 
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strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
enzymatic methods. 

The protein may also be produced by operably linking the isolated polynucleotide 
of the invention to suitable control sequences in one or more insect expression vectors, 
and employing an insect expression system. Materials and methods for 
baculovirus /insect cell expression systems are commercially available in kit form from, 
e.g., Invitrogen, San Diego, California, U.S.A. (the MaxBadD kit), and such methods are 
well known in the art, as described in Summers and Smith, Texas Aericultural Experiment 
Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

The protein of the invention may be prepared by culturing transformed host ceUs 
under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 
0 containing agents which will bind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, hepann-toyopearl® or Cibacrom blue 3GA 
Sepharose<B); one or more steps involving hydrophobic interaction chromatography using 
such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity 
chromatography. 

5 Alternatively, the protein of the invention may also be expressed in a form which 

will facilitate purification. For example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP), glutathione-S-transf erase (GST) or thioredoxin 
(TRX). Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, MA), Pharmacia (Piscataway, NJ) and 

0 InVitrogen, respectively. The protein can also be tagged with an epitope and 
subsequently purified by using a specific antibody directed to such epitope. One such 
epitope ("Flag") is commercially available from Kodak (New Haven, CT). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
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methyl or other aliphatic groups, can be employed to hirther purify the protein. Some or 
all of the foregoing purification steps/in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 
5 with the present invention as an "isolated protein." 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

10 The protein may also be produced by known conventional chemical synthesis. 

Methods for constructing the proteins of the present invention by synthetic means are 
known to those skilled in the art. The synthehcally-constructed protein sequences, by 
virtue of sharing primary, secondary or tertiary structural and /or conformational 
characterishcs with proteins may possess biological properties in common therewith, 

15 including protein activity. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified- proteins in screening- of therapeutic 
compounds and in immunological processes for the development of antibodies. 

The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are naturally 

2 0 provided or deliberately engineered. For example, modifications in the peptide or DNA 

sequences can be made by those skilled in the art using known techniques. Modifications 
of interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For 
example, one or more of the cysteine residues may be deleted or replaced with another 
25 amino acid to alter the conformation of the molecule. Techniques for such alteration, 
substitution, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., US, Patent No. 4,518,584). Preferably, such alteration, substitution, replacement, 
insertion or deletion retains the desired activity of the protein. 

Other fragments and derivatives of the sequences of proteins which would be 

3 0 expected to retain protein activity in whole or in part and may thus be useful for screening 

or other immunological methodologies may also be easily made by those skilled in the art 
- given the disclosures herein. Such modifications are believed to be encompassed by the 
present invention. " 
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USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit 
one or more of the uses or biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described for proteins of the present 
invention may be provided by administration or use of such proteins or by administration 
or use of polynucleotides encoding such proteins (such as, for example, in gene therapies 
or vectors suitable for introduction of DNA). 

Research Uses and Utilities 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express 
recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in disease 
states); as molecular weight markers on Southern gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene positions; to compare 
with endogenous DNA sequences in patients to identify potential genetic disorders; as 
probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" 
known sequences in the process of discovering other novel polynucleotides; for selecting 
and making oligomers for attachment to a "gene chip" or other support, including for 
examination of expression patterns; to raise anti-protein antibodies using DNA 
immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune response. Where the polynucleotide encodes a protein which binds or potentially 
binds to another protein (such as, for example, in a receptor-ligand interaction), the 
polynucleotide can also be used in interaction trap assays (such as, for example, those 
described in Gyuris et aL, 1993, Cell 75: 791-803 and in Rossi et al, 1997, Proc. Natl. Acad. 
Sci. USA 94: 8405-8410, all of which are incorporated by reference herein) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. 

The proteins provided by the present invention can similarly be used in assay to 
determine biological activity, including in a panel of multiple proteins for. high- 
throughput screerung; to raise antibodies or to elicit another , immune response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
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levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Where the protein binds or potentially 
5 binds to another protein (such as, for example, in a receptor-ligand interaction), the 
protein can be used to identify the other protein v^ith which binding occurs or to identify 
inhibitors of the binding interaction. Proteins involved in these binding interactions can 
also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

1 0 Any or all of these research utilities are capable of being developed into reagent 

grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 

the art. References disclosing such methods include without limitarion "Molecular 

Cloning: A Laboratory Manual", 2d ed.. Cold Spring Harbor Laboratory Press. Sambrook, 
15 J., E-F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to 

Molecular Cloning Techniques", Academic Press, Berger, S.L. and A,R. Kimmel eds., 1987. 

Nutritional Uses 

Polynucleotides and proteins of the present invention can also be used as 

2 0 nutritional sources or supplements. Such uses include without limitation use as a protein 

or amino add supplement, use as a carbon source, use as a rutrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid 
or liquid preparation, such as in the form of powder, pills, solutions, suspensions or 
25 capsules. In the case of microorganisms, the protein or polynucleotide of the invention 
can be added to the medium in or on which the microorganism is cultured. 

Cytokine and Cell Proliferation /Differentiation Activity 

A protein of the present invention may exhibit cytokine, cell proliferation (either 

3 0 inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may 

induce production of other cytokines in certain cell populations. Many protein factors 
discovered to date, including all known cytokines, have exhibited activity in one or more 
factor dependent cell proliferation assays, and hence the assays serve as a conveiuent 
confirmation of cytokine activity. The activity of a protein of the present invention is 
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evidenced by any one of a number of routine factor dependent cell proliferation assays 
for cell lines including, without limitation, 32D, DA2, DAIG, TIO, B9, B9/11. BaF3, 
MC9/G, M+ (preB iM+), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-l, Mo7e and CMK. 

5 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. 
Marguiies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 

1 0 Intersdence (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
Bertagnolli et al., J. Immunol. 145:1706-1712, 1990; Bertagnolli et al„ Cellular Immunology 
133:327-341, 1991; Bertagnolli, et al., J. Immunol. 149:3778-3783, 1992; Bowman et ai., J. 
Immunol. 152: 1756-1761, 1994. 

1 5 Assays for cytokine production and /or proliferation of spleen cells, lymph node 

cells or thymocytes include, without limitation, those described in; Polyclonal T.cell 
stimulation, Kruisbeek, A.M. and Shevach, E.M. In Current Protocols in Immunology. J.E.e.a. 
Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
Measurement of mouse and human Interferon y, Schreiber, R.D. In Current Protocols in 

2 0 Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L.S. and Lipsky, P.E. In Current 
Protocols in Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, 

2 5 Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al.. Nature 

336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sd. U.S.A. 80:2931-2938, 1983; 
Measurement of mouse and human interleukin 6 - Nordan, R. In Current Protocols in 
Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 
Smith et aL, Proc. Natl. Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 

3 0 Lnterleukin 11 - Bennett, P., Giannotti, J., Clark, S.C. and Turner, K. J. In Current Protocols 

in Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., Clark, S.C. 
and Turner, K.J. In Current Protocols in Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991 . 
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Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E, Coligan, A.M. Kruisbeek, D.H. Margulies, 
5 E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and 
their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et ah, 
Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et aL, Eur. J. Immun. 
11:405-411, 1981; Takai et al., J. Immunol. 137:3494-3500. 1986; Takai et al., J. Inununol. 
10 140:508-512, 1988. 

Immune Stimulating or Suppressing Activity 

A protein of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activihes for which assays 
15 are described herein. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., 
in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well 
as effecting the cytolytic activity of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal 

2 0 infections, or may result from autoimmune disorders. More specifically, infectious 

diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, 
herpesviruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections 
such as candidiasis. Of course, in this regard, a protein of the present invention may also 
25 be useful where a boost to the immune system generally may be desirable, i.e., in the 
treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 

3 0 Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 

myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. 
Such a protein of the present invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
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example, organ transplantation), may also be treatable using a protein of the present 
invention. 

Using the proteins of the invention it may also be possible to immune responses, 
in a number of vs^ays. Down regulation may be in the form of inhibiting or blocking an 
5 immune response already in progress or may involve preventing the induction of an 
immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression 
of T cell responses is generally an active, non-antigen-specific, process v^hich requires 
continuous exposure of the T cells to the suppressive agent. Tolerance, which involves 
10 inducing non-responsiveness or anergy in T cells, is distinguishable from 
immunosuppression in that it is generally antigen-specific and persists after exposure to 
the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack 
of a T cell response upon reexposure to specific antigen in the absence of the tolerizing 
agent. 

1 5 Down regulating or preventing one or more antigen functions (including without 

limitation B lymphocyte antigen functions (such as , for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft- versus-host disease (GVHD). For example, 
blockage of T cell function should result in reduced tissue destruction in tissue 

20 transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that destroys 
the transplant. The administration of a molecule which inhibits or blocks interaction of 
a 37 lymphocyte antigen with its natural ligand(5) on immune ceils (such as a soluble, 
monomeric form of a peptide having B7-2 activity alone or in conjunction with a 

2 5 monomeric form of a peptide having an activity of another B lymphocyte antigen {e.g., B7- 

1, B7-3) or blocking antibody), prior to transplantation can lead to the binding of the 
molecule to the natural ligand(s) on the immune cells without transmitting the 
corresponding costimulatory signal. Blocking B lymphocyte antigen function in this 
matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 

3 0 immunosuppressant. Moreover, the lack of costimulation may also be sufficient to 

anergize.the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolereince by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
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tolerance in a subject, it may also be necessary to block the function of a combination of 
B lymphocyte antigens. 

The efficacy of particular blocking reagents in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
5 humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al. Science 257:789-792 (1992) and Turka et al„ Proc Natl. Acad. 
Sci USA, §9:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 
10 Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of blocking B lymphocyte anhgen function in vivo on the development 
of that disease. 

Blocking antigen function may also be therapeutically useful for treahng 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 

1 5 activahon of T cells that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate disease symptoms. 
Administration of reagents which block costimulation of T cells by disrupting 
receptonligand interactions of B lymphocyte antigens can be used to inhibit T cell 

2 0 activation and prevent production of autoantibodies or T cell-derived cytokines which 
may be involved in the disease process. Additionally, blocking reagents may induce 
antigen-spedfic tolerance of autoreactive T cells which could lead to long-term relief from 
the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune 
disorders can be determined using a number of well-characterized animal models of 

2 5 human autoimmune diseases. Examples include murine experimental autoimmune 

encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, 
murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myastherua gravis (see Paul ed.. Fundamental Immunology, Raven 
Press, New York, 1989, pp. 840-856). 

3 0 Upregulation of an antigen function (preferably a B lymphocyte antigen function), 

as a means of up regulating immune responses, may also be useful irt therapy. 
Upregulation of immune responses may be in the form of enhancing an existing immune 
response or eliciting an initial immune response. For example, enhancing an immune 
response through stimulating B lymphocyte antigen function may be useful in cases of 
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viral infection. In addition, systemic v^iral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory forms of B 
lymphocyte antigens systemically. 

4 

Altemahvely, anti-viral immune responses may be enhanced in an infected patient 
5 by removing T cells from the patient, costimulating the T cells in vitro with viral antigen- 
pulsed APCs either expressing a peptide of the present invention or together with a 
stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. Another method of enhancing anti-viral immune 
responses would be to isolate infected cells from a patient, transfect them with a nucleic 

1 0 acid encoding a protein of the present invention as described herein such that the cells 
express all or a portion of the protein on their surface, and reintroduce the transfected 
cells into the patient. The infected cells would now be capable of delivering a 
costimulatory signal to, and thereby activate, T cells in vivo. 

In another application, up regulation or enhancement of antigen function 

15 (preferably B lymphocyte anrigen function) may be useful in the induction of tumor 
immuruty. Tumor cells {e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) transfected with a nucleic acid encoding at least one peptide of the present 
invention can be administered to a subject to overcome tumor-specific tolerance in the 
subject. If desired, the tumor cell can be transfected to express a combination of peptides. 

2 0 For example, tumor cells obtained from a patient can be transfected ex vivo with an 
expression vector directing the expression of a peptide having B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The 
transfected tumor cells are returned to the patient to result in expression of the peptides 
on the surface of the transfected cell. Alternatively, gene therapy techniques can be used 

2 5 to target a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention having the activity of a B 
lymphocyte antigen(s) on the surface of the tumor ceU provides the necessary 
costimulation signal to T cells to induce a T cell mediated immune response against the 
transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC class II 

3 0 molecules, or which fail to reexpress sufficient amounts of MHC class I or MHC class II 

molecules, can be transfected with nucleic acid encoding all or a portion of {e,g., a 
cytoplasmic-domain truncated portion) of an MHC class I a chain protein and 
microglobulin protein or an MHC class II a chain protein and an MHC class 11 p chain 
protein to thereby express MHC class I or MHC class II proteins on the cell surface. 
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Expression of the appropriate class I or class II MHC in conjunction with a peptide having 
the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell mediated 
immune response against the transfected tumor cell. Optionally, a gene encoding an 
antisense construct which blocks expression of an MHC class II associated protein, such 
5 as the invariant chain, can also be cotransfected with a DNA encoding a peptide having 
the activity of a B lymphocyte antigen to promote presentation of tumor associated 
antigens and induce tumor specific immunity. Thus, the induction of a T ceD mediated 
immune response in a human subject may be sufficient to overcome tumor-specific 
tolerance in the subject. 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. 
Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates 

1 5 and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1- 
3.19; Chapter 7, Inrununologic studies in Humans); Herrmann et aL, Proc. Natl. Acad. Sci. 
USA 78:2488-2492, 1981; Herrmann et aL, J. Immunol. 128:1968-1974, 1982; Handa et aL, 
J. Immunol. 135:1564:-1572, 1985; Takai et aL, J. Immunol. 137:3494-3500, 1986; Takai et al., 
J. Immunol. 140:508-512, 1988; Herrmann et aL, Proc. NatL Acad. Sci. USA 78:2488-2492, 

2 0 1981; Herrmaim et aL, J. ImmianoL 128:1968-1974, 1982; Handa et aL, J. Immunol. 

135:1564-1572, 1985; Takai et aL, J. Immunol. 137:3494r-3500, 1986; Bowmanet aL, J. 
Virology 61:1992-1998; Takai et aL, J. Immunol. 140:508-512, 1988; Bertagnolli et aL, 
Cellular Immunology 133:327-341, 1991; Brown et aL, J. Immunol. 153:3079-3092, 1994. 
Assays for T-cell-dependent immunoglobulin responses and isotype switching 
25 (which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described 
in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro 
antibody production, Mond, J.J. and Brunswick, M. In Current Protocols in Immunology. 
J.E.e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

3 0 Ivlixed lymphocyte reaction (MLR) assays (which will identify, among others, 

protieins that generate predominantly Thl and CTL responses) include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, 
D.H. Mcirgulies, E.M^ Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
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1, Immunologic studies in Humans); Takai et aL, J. Immunol. 137:3494-3500, 1986; Takai 
et al., J. Immunol. 140:508-512, 1988; BertagnoUi et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
5 described in: Guery et al, J. Immunol. 134:536-544, 1995; Inaba et ah. Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al.. Journal of Immunology 
154:5071-5079, 1995; Porgador et al.. Journal of Experimental Medicine 182:255,-260, 1995; 
Nair et al. Journal of Virology 67:4062-4069, 1993; Huang et al.. Science 264:961-965, 
1994; Macatonia et al.. Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj 

10 et al.. Journal of Clinical InvesHgation 94:797-807, 1994; and Inaba et al.. Journal of 
Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/ apoptosis (which will identify, among others, 
proteins that prevent apoptosis after superantigen induchon and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 

15 et al.. Cytometry 13:795-808, 1992; Gorczyca et al.. Leukemia 7:659-670, 1993; Gorczyca et 
al.. Cancer Research 53:1945-1951, 1993; Itch et al., CeU 66:233-243, 1991; Zacharchuk, 
Journal of Immunology 145:4037-4045, 1990; Zamai et al.. Cytometry 14:891-897, 1993; 
Gorczyca et ai.. International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 

2 0 development include, without limitahon, those described in: Andca et al.. Blood 
84:111-117, 1994; Fine et al.. Cellular Immunology 155:111-122, 1994; Galy et al.. Blood 
85:2770-2778, 1995; Toki et al., Proc. Nat Acad Sci. USA 88:7548-7551, 1991. 

Hematopoiesis Regulahng Activity 

25 A protein of the present invention may be useful in regulation of hematopoiesis 

and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming cells or of factor-dependent cell 
lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and 
proliferation of erythroid progenitor cells alone or in combination with other cytokines, 

,30 thereby indicating utility, for example, in treating various anemias or for use in 
conjunction with irradiation/chemotherapy to stimulate the production of erythroid 
precursors and /or erythroid cells; in supporting the growth and proliferation of myeloid 
cells such as granulocytes and monocytes/ macrophages (i.e.> traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to prevent or treat consequent 
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myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet 
disorders such as thrombocytopenia, and generally for use in place of or complimentary 
to platelet transfusions; and/or in supporting the growth and proliferahon of 
5 hematopoietic stem cells which are capable of maturing to any and all of the above- 
mentioned hematopoiehc cells and therefore find therapeutic utility in various stem ceil 
disorders (such as those usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem cell compartment post irradiation/ chemotherapy, either in-vivo or 

10 ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 
progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

15 Suitable assays for proliferation and differentiation of various hematopoietic lines 

are cited above. 

Assays for embryoruc stem cell differentiahon (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et 
2 0 al.. Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al.. Blood 
81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which will identify, among 
others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 

2 5 Hematopoietic Cells. R.I. Freshney, et al eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, 

NY. 1994; Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-5911, 1992; Primitive 
hematopoietic colony forming cells with high proliferative potential, McNiece, I.K. and 
Briddell, R.A. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 23-39, 
Wiley-Liss, Inc, New York, NY. 1994; Neben et al.. Experimental Hematology 22:353-359, 

3 0 1994; Cobblestone area forming cell assay, Ploemacher, R.E. In Culture of Hematopoietic 

Cells. R.L Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, NY. 1994; Long 
term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and 
Allen, T. In Culture of Herrmtopoietic Cells. R-L Freshney, et aL eds. Vol pp. 163-179, 
Wiley-Liss, Inc., New York, NY. 1994; Long term culture initiating cell assay, Sutherland, 
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H.J. In Culture of Hematopoietic Cells, R.I. Freshney, et al eds. Vol pp. 139-162, Wiley-Liss, 
Inc., New York, NY. 1994. 

Tissue Growth Activity 

A protein of the present invention also may have utility in compositions used for 
bone, cartilage, tendon, ligament and /or nerve tissue growth or regeneration, as well as 
for wound healing and tissue repair and replacement, and in the treatment of bums, 
incisions and ulcers. 

A protein of the present invention, which induces cartilage and /or bone growth 
in circumstances where bone is not normally formed, has application in the healing of 
bone fractures and cartilage damage or defects in humans and other animals. Such a 
preparation employing a protein of the invenrion may have prophylactic use in closed as 
well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of 
congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment 
to attract bone-forming cells, stimulate growth of bone-forming ceils or induce 
differentiation of progenitors of bone-forming cells. A protein of the invention may also 
be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation 
of bone and /or carhlage repair or by blocking inflammation or processes of tissue 
destruction (coilagenase activity, osteoclast activity, etc.) mediated by inflammatory 
processes. 

Another category of tissue regeneration activity that may be attributable to the 
protein of the present invention is tendon/ ligament formation. A protein of the present 
invention, which induces tendon /ligament-like tissue or other tissue formation in 
circumstances where such tissue is not normally formed, has application in the healing of 
tendon or ligament tears, deformities and other tendon or ligament defects in humans and 
other animals. Such a preparation employing a tendon /ligament-like tissue inducing 
protein may have prophylactic use in preventing damage to tendon or ligament tissue, as 
well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon /ligament-like tissue 
formation induced by a composition of the present invention contributes to the repair of 
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congenital, trauma induced, or other tendon or ligament defects of other origin, and is also 
useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming ceils, induce 
5 differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progenitors ex vivo for return in vivo to effect hssue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, carpal 
tunnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and /or sequestering agent as a carrier as is well known in 
1 0 the art. 

The protein of the present invention may also be useful for proliferahon of neural 
ceils and for regeneration of nerve and brain tissue, i.e. for the treatment of central and 
peripheral nervous system diseases and neuropathies, as well as mechanical and 
traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 

15 tissue. More specifically, a protein may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and ' 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Himtington's disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in accordance with the present 

2 0 invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a protein of the 
invention. 

Proteins of the invention may also be useful to promote better or faster closure of 

2 5 non-healing wounds, including without limitation pressure ulcers, ulcers associated with 

vascular insufficiency, surgical and traumatic wovmds, and the like. 

It is expected that a protein of the present invention may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) 

3 0 and vascular (including vascular endothelium) tissue, or for promoting the growth of cells 

corriprising such tissues. Part of the desired effects may be by ir^hibition or modulation 
of fibrotic scarring to allow normal tissue to regenerate. A protein of the invention may 
also exhibit angiogenic activity. 
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A protein of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, 
and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting 
the growth of tissues described above. 

The achvity of a protein of the invention may, among other means, be measured 
by the following methods:. 

Assays for hssue generation activity include, without hmitation, those described 
in: International Patent Publication No. W095/ 16035 (bone, cartilage, tendon); 
International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 
Publication No. WO91/07491 (skin, endothelium ). 

Assays for wound healing activity include, without limitation, those described in: 
Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, HI and Rovee, DT, eds.). Year 
Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 

Activin/Inhibin Activity 

A protein of the present invention may also exhibit activin- or inhibin- related 
activities. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to 
stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the present 
invention, alone or in heterodimers with a member of the inhibin a family, may be useful 
as a contraceptive based on the ability of inhibins to decrease fertility in female mammals 
and decrease spermatogenesis in male mammals. Administration of sufficient amounts 
of other inhibins can induce infertility in these mammals. Alternatively, the protein of the 
invention, as a homodimer or as a heterodimer with other protein subimits of the ii\hibin- 
P group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example. 
United States Patent 4,798,885. A protein of the invention may also be useful for 
advancement of the onset of fertility in sexually immature maiiunals, so as to increase the 
lifetime reproductive performance of domestic animals such as cows, sheep and pigs. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 
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Assays for activin/inhibin activity include, without limitation, those described in: 
Vale et al., Endocrinology 91:562-572, 1972; Ling et al.. Nature 321:779-782, 1986; Vale et 
al.. Nature 321:776-779, 1986; Mason et al.. Nature 318:659-663, 1985; Forage et ah, Proc. 
Natl. Acad. Sci, USA 83:3091-3095, 1986. 

5 

Chemotactic/Chemokinetic Activity 

A protein of the present invention may have chemotactic or chemokinetic achvity 
(e.g., act as a chemokine) for mammailian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. 
10 Chemotactic and chemokinetic proteins can be used to mobilize or attract a desired cell 
population to a desired site of action. Chemotactic or chemokinetic proteins provide 
particular advaintages in treatment of wounds and other trauma to hssues, as well as in 
. - treatment of localized infections. For example, attraction of lymphocytes, monocytes or 
neutrophils to tumors or sites of infechon may result in improved immxme responses 
15 against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population 
2 0 of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 

2 5 chemotaxis) consist of assays that measure the ability of a protein to induce the migration 

of cells across a membrane as well as the ability of a protein to induce the adhesion of one 
cell population to another cell -population. Suitable assays for movement and adhesion 
include, without limitation, those described in: Current Protocols in Immunology, Ed by 
J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 

3 0 Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and 

beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
APMIS 103:140-146, 1995; MuUer et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. ]. of Immunol. 153: 1762-1768, 1994. 
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Hemostatic and Thrombolytic Activity 

A protein of the invention may also exhibit hemostatic or thrombolytic activity. 
As a result, such a protein is expected to be useful in treatment of various coagulation 
disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation 
5 and other hemostatic events in treating wounds resulting from trauma, surgery or other 
causes. A protein of the invention may also be useful for dissolving or irU^ibiting 
formation of thromboses and for treatment and prevention of conditions resulting 
therefrom (such as, for example, infarction of cardiac and central nervous system vessels 
(e.g., stroke). 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al.. Thrombosis 
Res. 45:413-419, 1987; Humphrey et al.. Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

15 35:467-474,1988. 

Receptor /Ligand Activity 

A protein of the present invention may also demonstrate activity as receptors, 
receptor ligands or inhibitors or agonists of receptor /ligand interactions. Examples of 
2 0 such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor /ligand pairs involved in antigen presentation, antigen recognition and 

2 5 development of cellular and humoral immune responses). Receptors and ligands are also 

useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor /ligand interactions. 

3 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those 
described iniCurrent Protocols in Immimology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E^. Shevach, W.Strober, Pub. Greene Publishing Associates and 
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Wiley-Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 
Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et ah, J. Exp. Med. 169:149-160 
1989; Stoltenborg et al., J. Iirununoi. Methods 175:59-68, 1994; Stitt et al., CeU 80:661-670, 
5 1995. 

Anti-Inflammatory Activity 

Proteins of the present invention may also exhibit anti-inflammatory activity. The 
anh-inflammatory activity may be achieved by providing a stimulus to cells involved in 

10 the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities can be used to treat 

15 inflammatory conditions including chronic or acute conditions), including without 
limitation inflammation associated with infection (such as septic shock, sepsis or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 

2 0 from over production of cytokines such as TTNF or IL-1 . Proteins of the invention may also 
be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

Cadherin /Tumor Invasion Suppressor Activity 

Cadherins are ccdcium-dependent adhesion molecules that appear to play major 

2 5 roles during development, particularly in defining specific cell types. Loss or alteration 

of normal cadherin expression can lead to changes in cell adhesion properties linked to 
tumor growth and metastasis. Cadherin malfunction is also implicated in other human 
diseases, such as pemphigxis vulgaris and pemphigus foliaceus (auto-immune blistering 
skin diseases), Crohn's disease, and some developmental abnormalities. 

3 0 The cadherin superfamily includes well over forty members, each with a distinct 

pattern of expression. All members of the superfamily have in common conserved 
extracellular repeats (cadherin domains), but structural differences are found in other 
parts of the molecule. The cadherin domains bind calcium to form their tertiary structure 
and thus calcium is required to mediate their adhesion. Only a few amino acids in the 
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first cadherin domain provide the basis for homophilic adhesion; modification of this 
recognition site can change the specificity of a cadherin so that instead of recognizing only 
itself, the mutant molecule can now also bind to a different cadherin. In addition, some 
cadherins engage in heterophilic adhesion with other cadherins. 
5 E-cadherin, one member of the cadherin superfamily, is expressed in epithelial cell 

types. Pathologically, if E-cadherin expression is lost in a tumor, the malignant cells 
become invasive and the cancer metastasizes. Transfection of cancer cell lines with 
polynucleotides expressing E-cadherin has reversed cancer-associated changes by 
returning altered cell shapes to normal, restoring cells' adhesiveness to each other and to 

10 their substrate, decreasing the cell growth rate, and drastically reducing anchorage- 
independent cell growth. Thus, reintroducing E-cadherin expression reverts carcinomas 
to a less advanced stage. It is likely that other cadherins have the same invasion 
suppressor role in carcinomas derived from other tissue types. Therefore, proteins of the 
present invention with cadherin activity, and polynucleotides of the present invention 

15 encoding such proteins, can be used to treat cancer. Introducing such proteins or 
polynucleotides into cancer cells can reduce or eliminate the cancerous changes observed 
in these cells by providing normal cadherin expression. 

Cancer cells have also been shown to express cadherins of a different tissue type 
than their origin, thus allowing these cells to invade and metastasize in a different tissue 

2 0 in the body. Proteins of the present invention with cadherin activity, and polynucleotides 

of the present invention encoding such proteins, can be substituted in these cells for the 
inappropriately expressed cadherins, restoring normal cell adhesive properties and 
reducing or eliminating the tendency of the cells to metastasize. 

Additionally, proteii\s of the present invention with cadherin activity, and 
25 polynucleotides of, the present invention encoding such proteins, can used to generate 
antibodies recognizing and binding to cadherins. Such antibodies can be used to block 
the adhesion of inappropriately expressed tumor-cell cadherins, preventing the cells from 
forming a tumor elsewhere. Such ari anti-cadherin antibody can also be used as a marker 
for the grade, pathological type, and prognosis of a cancer, i.e. the more progressed the 

3 0 cancer, the less cadherin expression there will be, and this decrease in cadherin expression 

can be detected by the use of a cadherin-binding antibody. 

Fragments of proteins of the present invention with cadherin activity, preferably 
a polypeptide comprising a decapeptide of the cadherin recognition site, and poly- 
nucleotides of the present invention encoding such protein fragments, can also be used 
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to block cadherin function by binding to cadherins and preventing them from binding in 
ways that produce undesirable effects. Additionally, fragments of proteins of the present 
invention with cadherin activity, preferably truncated soluble cadherin fragments which 
have been found to be stable in the circulation of cancer patients, and polynucleotides 
5 encoding such protein fragments^ can be used to disturb proper cell-cell adhesion. 

Assays for cadherin adhesive and invasive suppressor activity include, without 
limitation, those described in: Hortsch et al. J Biol Chem 270 (32): 18809-18817, 1995; 
Miyaki et al. Oncogene 11: 2547-2552, 1995; Ozawa et al. Cell 63: 1033-1038, 1990. 

10 Tumor Inhibition Activity 

In addition to the activities described above for immunological treatment or 
prevention of tumors, a protein of the invention may exhibit other anti-tumor activities. 
A protein may inhibit tumor growth directly or indirectly (such as, for example, via 
ADCC). A protein may exhibit its tumor inhibitory activity by acting on tumor tissue or 

1 5 tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor 
growth (such as, for exarhple, by iiUiibiting angiogenesis), by causing production of other 
factors, agents or cell types which inhibit tumor growth, or by suppressing, eliminating 
or inhibiting factors, agents or cell types which promote tumor growth. 

2 0 Other Activities 

A protein of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, irifection or function of, or killing, infectious 
agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting 
(suppressing or enhancing) bodily characteristics, including, without limitation, height, 

2 5 weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ 

or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or caricadic cycles or rhythms; 
effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 

3 0 carbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); 

effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cogiution (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
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lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunogiobuUn-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen 
5 in a vaccine composition to raise an immune response against such protein or another 
material or entity which is cross-reactive with such protein. 



ADMINISTRATION AND DOSING 

10 A protein of the present invention (from whatever source derived, including 

without limitation from recombinant and non-recombinant sources) may be used in a 
pharmaceutical composition when combined with a pharmaceutically acceptable carrier. 
Such a composition may also contain (in addition to protein and a carrier) diluents, fillers, 
salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 

15 "pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active LngTedient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of 
the invention may also contain cytokines, lymphokines, or other hematopoietic factors 
such as M-CSF, GM-CSF, TNP, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-S, IL-9, IL-10, IL-11, 

2 0 IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNFl, TNF2, G-CSF. Meg-CSF, thrombopoietin, stem 

cell factor, and erythropoietin. The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or compliment its activity or use 
in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein of the invention, 
25 or to minimize side effects. Conversely, protein of the present invention may be included 
in formulations of the particular cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory agent to minimize side effects 
of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti-inflammatory agent. 

3 0 A protein of the present invention may-be active in multimers (e.g., heterodimers 

or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical 
compositions of the invention may comprise a protein of the invention in such multimeric 
or complexed form. 
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The pharmaceutical composition of the invention may be in the form of a complex 
of the protein(s) of present invention along with protein or peptide anhgens. The protein 
and/or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. B 
lymphocytes will respond to antigen through their surface immunoglobulin receptor. T 
5 lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the anhgen by MHC proteins. MHC and structurally related proteins 
including those encoded by class I and class II MHC genes on host cells will serve to 
present the peptide antigen(s) to T lymphocytes. The antigen components could also be 
supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that 
10 can directly signal T cells. Alternatively antibodies able to bind surface immunolgobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the 
invention. 

The pharmaceutical composition of the invention may be in the form of a liposome 
15 in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, 
2 0 and the like. Preparation of such liposomal formulations Ls within the level of skill in the 
art, as disclosed, for example, in U.S. Patent No. 4,235,871; U.S. Patent No. 4,501,728; U.S. 
Patent No. 4,837,028; and U.S. Patent No. 4,737,323, all of which are incorporated herein 
by reference. 

As used herein, the term "therapeutically effective amount" mear\s the total 

2 5 amount of each active component of the pharmaceutical composition or method that is 

sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention or 
amelioration of the releveint medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, the term refers to that ingredient alone. When applied to 

3 0 a combination, the term refers to combined amounts of the active ingredients that result 

in the therapeutic effect, whether administered in combination, serially or simultaneously. - 
In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein of the present invention is administered to a 
mammal having a condition to be treated. Protein of the present invention may be 
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administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines 
or other hematopoietic factors. When co-administered with one or more cytokines, 
lymphokines or other hematopoietic factors, protein of the present invention may be 
5 administered either simultaneously with the cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If 
administered sequentially, the attending physician will decide on the appropriate 
sequence of administering protein of the present invention in combination with 
cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolyhc or anti-thrombotic 
10 factors. 

Administration of protein of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 

1 5 Intravenous administration to the patient is preferred. 

When a therapeutically effective amount of protein of the present invention is 
administered orally^ protein of the present invention will be in the form of a tablet, 
capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical 
composition of the invention may additionally contain a solid carrier such as a gelatin or 

2 0 an adjuvant. The tablet, capsule, and powder contain from about 5 to 957o protein of the 
present invention, and preferably from about 25 to 90% protein of the present invention. 
When administered in liquid form, a liquid carrier such as water, petroleum, oils of animal 
or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils 
may be added. The liquid form of the pharmaceutical composition may further contain 

2 5 physiological saline solution, dextrose or other saccharide solution, or glycols such as 

ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid 
form, the pharmaceutical composition contains from about 05 to 90% by weight of protein 
of the present invention, and preferably from about 1 to 50% protein of the present 
invention, 

3 0 When a therapeutically effective amount of protein of the present invention is 

administered by intravenous, cutaneous or subcutaneous injection, protein of the present 
invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution. 
The preparation of such parenterally acceptable protein solutions^ having due regard to 
pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 
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pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should 
contain, in addition to protein of the present invention, an isotonic vehicle such as Sodium 
Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The 
pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art. 

The amount of protein of the present invention in the pharmaceutical composition 
of the present invention will depend upon the nature and severity of the condition being 
treated, and on the nature of prior treatments which the patient has undergone. 
Ultimately, the attending physician will decide the amount of protein of the present 
invention with which to treat each individual patient Initially, the attending physician 
will administer low doses of protein of the present invention and observe the patient's 
response. Larger doses of protein of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased further. It is contemplated that the various pharmaceutical compositions used 
to practice the method of the present invention should contain about 0.01 yxg to about 100 
mg (preferably about O.lng to about 10 mg, more preferably about 0.1 ^g to about 1 mg) 
of protein of the present invention per kg body weight. 

The duration of intravenous therapy using the pharmaceutical composition of the 
present invention will vary, depending on the severity of the disease being treated and 
the condition and potential idiosyncratic response of each individual patient. It is 
contemplated that the duration of each application of the protein of the present invention 
will be in the range of 12 to 24 hours of continuous intravenous administration. 
Ultimately the attending physician will decide on the appropriate duration of intravenous 
therapy using the pharmaceutical composition of the present invention. 

Protein of the invention may also be used to immunize animals to obtain 
polyclonal and monoclonal antibodies which specifically react with the protein. Such 
antibodies may be obtained using either the entire protein or fragments thereof as an 
immunogen. The peptide immunogens addihonally may contain a cysteine residue at the 
carboxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyanin 
(KLH). Methods for synthesizing such peptides are known in the art, for example, as in 
R.P. Merrifield, J. Amer.Chem.Soc. 85, 2149-2154 (1963); J.L. Krstenansky, et al, FEBS Lett. 
211, 10 (1987). Monoclonal antibodies binding to the protein of the invention may be 
useful diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
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antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting 
5 and preventing the metastatic spread of the cancerous cells, which may be mediated by 
the protein. 

For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the 
composition topically, systematically, or locally as an implant or device. When 

10 administered, the therapeutic composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, the composition may desirably 
be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or 
tissue damage. Topical administration may be suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of the invention which may also 

15 optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and/or cartilage formation, the 
composition would include a matrix capable of delivering the protein-containing 
composition to the site of bone and /or cartilage damage, providing a structure for the 

2 0 developing bone and cartilage and optimally capable of being resorbed into the body. 
Such matrices may be formed of materials presently in use for other implanted medical 
applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 

2 5 application of the compositions will define the appropriate formulation. Potential 

matrices for the compositions may be biodegradable and chemically defined calcium 
sulfate, tricalciumphosphate, hydroxyapatite, polylactic acid, polyglycolic acid and 
polyanhydrides. Other potential materials are biodegradable and biologically well- 
defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins 

3 0 or extracellular matrix components. Other potential matrices are nonbiodegradable and 

chemically defined, such as sintered hydroxapatite, bioglass, aluminates, or other 
ceramics. Matrices may be comprised of combinations of any of the above mentioned 
types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in composition, such as in calcium- 



67 



wo 98/55614 



PCT/US98/11210 



aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 

Presently preferred is a 50:50 (mole weight) copoiymer of lactic acid and glycolic 
acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
5 In some applications, it will be useful to utilize a sequestering agent, such as 
carboxymethyl cellulose or autologous blood clot to prevent the protein compositions 
from disassociating from the matrix. 

A preferred family of sequestering agents is celiulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 

1 0 ethylcellulose, hydroxyethylceilulose, hydroxypropylcellulose, hydroxypropyl- 
methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of 
carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic 
acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl 
polymer and poly (vinyl alcohol). The amount of sequestering agent useful herein is 0.5-20 

15 wt%, preferably 1-10 wt% based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from the polymer matrix and to 
provide appropriate handling of the composition, yet not so much that the progenitor cells 
are prevented from infiltrating the matrix, thereby providing the protein the opportunity 
to assist the osteogenic activity of the progenitor cells. 

2 0 In further compositions, proteins of the invention may be combined with other 

agents beneficial to the treatment of the bone and /or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor 
(EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-a and 
TGF-p), and insuiin-like growth factor (IGF). 
25 The therapeutic compositions are also presently valuable for veterinary 

applications. Particularly domestic animals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins of the present invention. 

The dosage regimen of a protein-containing pharmaceurtcal composition to be 
used in tissue regeneration will be determined by the attending physician considering 

3 0 various factors which modify the action of the proteins, e.g., amoimt of tissue weight 

desired to be formed, the site of damage, the condition of the damaged tissue, the size of 
a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity 
of any irifection, time of admirustration and other clinical factors. The dosage may vary 
with the type of matrix used in the reconstitution and with inclusion of other proteins in 
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the pharmaceutical composition. For example, the addition of other known growth 
factors, such as IGF I (insulin like growth factor 1), to the final composition, may also effect 
the dosage. Progress can be monitored by periodic assessment of tissue/bone growth 
and /or repair, for example. X-rays, histomorphometric determinations and tetracycline 
labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, without 
limitation, in the form of viral vectors or naked DMA). 

Cells may also be cultured ex vivo in the presence of proteins of the present 
invenhon in order to proliferate or to produce a desired effect on or activity in such cells. 
Treated cells can then be introduced in vivo for therapeutic purposes. 

Patent and literature references cited herein are incorporated by reference as if 
fully set forth. 
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SEQUENCE LISTING 



15 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Jacobs, Kenneth 
McCoy, John M. 
LaValiie, Edward R. 
Racie, Lisa A. 
10 Treacy, Maurice 

Spaulding. Vikki 
Agostino, Michael J. 
Howes , Steven H. 
Fechtel , Kim 

(ii) TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 
ENCODING THEM 

(iii) NUMBER OF SEQUENCES: 3 2 

20 

Uv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute. Inc. 
(3) STREET: 37 Cambr idgePark Drive 

(C) CITY: Caaibridge 

2 5 ■ (D) STATE: MA . 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: ' 

3 0 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: lEM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

3 5 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

{ C ) CLASSIFICATION : 

4 0 (viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Sprunger, Suzanne A. 
(3) REGISTRATION NUMBER: 41,3 23 

(ix) TELECOMMUNICATION INFORMATION: 
45 (.^) TELEPHONE: (617) 498-8284 

(B) TELEFAX: (617) 876-5851 



(2) INFOR>lATION FOR SEQ ID NO : 1 : 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
55 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAGTGGAGTC TGTACTGGCT GCGGGGGACC CTGCTCATTT GAAAATCTGA CATCAGCTGG 50 

10 GCAGTCGCCC CCCTCCTCCT TTCCTCCCTC TACTCTGACA CAGCACTTAG CACCTGAATC 120 

TTCGTTTCTC TCCCAGGGAC CCTCCATTTT CCATATCCAG GAA.aATGTGA TGCGCCACAG 180 

GTATCAGCGT CTGGATCGCC ACTTCACGTT TTAGCCACAA GTGACTCAGT GGAAGATCCA 240 

15 

GAGTCAACAG AGGCTCGTCA GGAAGATGTC TACAGAAAAG GTAGACCAAA AGGAGGAAGC 300 

TGGGGAAAAA GAGGTGTGCG GAGACCAGAT CAARGGACCG GACAAAGAGG AGGAACCACC 360 

2 0 AGCTGCTGCA TCCCATGGCC AGGGGTGGCG TCCAGGTGGC AGAGCAGCTA GGAACGCAAG 42 0 

GCCTGAACCT GGGGCCAGAC ACCCTGCTCT CCCGGCCATG GTCAACGACC CTCCAGTACC 48 0 

TGCCTTACTG TGGGCCCAGG AGGTGGGCCA AGTCTTGGCA. GGCCGTGCCC GCAGGCTGCT 54 0 
25 . 

GCTGCAGTTT GGGGTGCTCT TCTGCACCAT CCTCCTTTTG CTCTGGGTGT CTGTCTTCCT 60 0 

CTATGGCTCC TTCTACTATT CCTATATGCC GACAGTCAGC CACCTCAGCC CTGTGCATTT 66 0 

3 0 CTACTACAGG ACCGAGTGTG ATTCCTCCAC CACCTCACTC TGCTCCTTCC CTGTTGCCAA 72 0 

TGTCTCGCTG ACTAAGGGTG GACGTGATCG GGTGCTGATG TATGGACAGC CGTATCGTGT 7 80 

TACCTTAGAG CTTGAGCTGC CAGAGTCCCC TGTGAATCAA GATTTGGGCA TGTTOTTGGT 84 0 

35 

CACCATTTCC TGCTACACCA GAGGTGGCCG AATCATCTCC ACTTCTTCGC GTTCGGTGAT 9 00 

GCTGCATTAC CGCTCAGACC TGCTCCAGAT GCTGGACACA CTGGTCTTCT CTAGCCTCCT 9 60 

40 GCTATTTGGC TTTGCAGAGC AGAAGCAGCT GCTGGAGGTG GAACTCTACG CAGACTATAG 1020 

AGAGAACTCG TACGTGCCGA CCACTGGAGC GATCATTGAG ATCCACAGCA AGCGCATCCA 10 80 

GCTGTATGGA GCCTACCTCC GCATCCACGC GCACTTCACT GGGCTCAGAT ACCTGCTATA 1140 

45 

CAACTTCCCG ATGACCTGCG CCTTCATAGG TGTTGCCAGC AACTTCACCT TCCTCAGCGT 1200 

CATCGTGCTC TTCAGCTACA TGCAGTGGGT GTGGGGGGGC ATCTGGCCCC GACACCGCTT 12 60 

50 CTCTTTGCAG GTTAACATCC GAAAAAGAGA. CAATTCCCGG AAGGAAGTCC AACGAAGGAT 13 2 0 

CTCTGCTCAT CAGCCAGGGC CTGAAGGCCA GGAGGAGTCA ACTCCGCAAT CAGATGTTAC 13 80 

AGAGGATGGT GAGAGCCCTG AAGATCCCTC AGGGACAGAG GGTCAGCTGT CCGAGGAGGA 144 0 

55 
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GAAACCAGAT CAGCAGCCCC TGAGCGGAGA AGAGGAGCTA GAGCCTGAGG CCAGTGATGG 15 0 0 

TTCAGGCTCC TGGGAAGATG CAGCTTTGCT GACGGAGGCC AACCTGCCTG CTCCTGCTCC 15 6 0 

5 TGCTTCTGCT TCTGCCCCTG TCCTAGAGAC TCTGGGCAGC TCTGAACCTG CTGGGGGTGC 162 0 

TCTCCGACAG CGCCCCACCT GCTCTAGTTC CTGAAGAAAA GGGGCAGACT CCTCACATTC 16 8 0 

CAGCACTTTC CCACCTGACT CCTCTCCCCT CGTTTTTCCT TCAATAAACT ATTTTGTGTC 17 40 

10 

.OuAAAAAAAAA AAAAA 1755 
(2) INFORMATION FOR SEQ ID NO : 2 : 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: procein 



25 . - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ser Thr Glu Lys Val Asp Gin Lys Glu Glu Ala Gly Glu Lys Glu 
15 10 15 

30 

Val Cys Gly Asp Gin lie Lys Gly Pro Asp Lys Glu Glu Glu Pro Pro 
20 25 30 

Ala Ala Ala Ser His Gly Gin Gly Trp Arg Pro Gly Gly Arg Ala Ala 
35 35 40 45 

Arg Asn Ala Arg Pro Glu Pro Gly Ala Arg His Pro Ala Leu Pro Ala 
50 55 60 

4 0 Met Val Asn Asp Pro Pro Val Pro Ala Leu Leu Trp Ala Gin Glu Val 

65 70 75 80 



45 



Gly Gin Val Leu Ala Gly Arg Ala Arg Arg Leu Leu Leu Gin Phe Gly 
85 90 95 

Val Leu Phe Cys Thr He Leu Leu Leu Leu Trp Val Ser Val Phe Leu 
100 105 110 



Tyr Gly Ser Phe Tyr Tyr Ser Tyr Met Pro Thr Val Ser His Leu Ser 

50 lis 120 125 

Pro Val His Phe Tyr Tyr Arg Thr Asp Cys Asp Ser Ser Thr Thr Ser 
130 135 140 

55 Leu Cys Ser Phe Pro Val Ala Asn Val Ser Leu Thr Lys Gly Gly Arg 
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145 150 155 160 

Asp Arg Val Leu Met Tyr Gly Gin Pro Tyr Arg Val Thr Leu Glu Leu 
155 170 175 

Glu Leu Pro Glu Ser Pro Val Asn Gin Asp Leu Gly Met Phe Leu Val 
180 135 190 

Thr lie Ser Cys 'Tyr Thr Arg Gly Gly Arg lie lie Ser Thr Ser Ser 
195 200 205 

Arg Ser Val Met Leu His Tyr Arg Ser Asp Leu Leu Gin Met Leu Asp 
210 215 220 

Thr Leu Val Phe Ser Ser Leu Leu Leu Phe Gly Phe Ala Glu Gin Lys 
225 230 235 240 

Gin Leu Leu Glu Val Glu Leu Tyr Ala Asp Tyr Arg Glu Asn Ser Tyr 
245 250 255 

Val Pro Thr Thr Gly Ala lie lie Glu He His- Ser Lys Arg lie Gin 
260 265 270 

Leu Tyr Gly Ala Tyr Leu Arg lie His Ala His Phe Thr Gly Leu Arg 
275 280 285 

Tyr Leu Leu Tyr Asn Phe Pro Met Thr Cys Ala Phe He Gly Val Ala 
290 295 300 

Ser Asn Phe Thr Phe Leu Ser Val He Val Leu Phe Ser Tyr Met Gin 
305 310 315 320 

Trp Val Trp Gly Gly He Trp Pro Arg His Arg Phe Ser Leu Gin Val 
325 330 335 

Asn He Arg Lys Arg Asp Asn Ser Arg Lys Glu Val Gin Arg Arg He 
340 345 350 

Ser Ala His Gin Pro "Gly Pro Glu Gly Gin Glu Glu Ser Thr Pro Gin 
355 360 355 

Ser Asp Val Thr Glu Asp Gly Glu Ser Pro Glu Asp Pro Ser Gly Thr 
370 . 375 380 

Glu Gly Gin Leu Ser Glu Glu Glu Lys Pro Asp Gin Gin Pro Leu Ser 
335 390 395 400 

Gly Glu Glu Glu Leu Glu Pro Glu Ala Ser Asp Gly Ser Gly Ser Trp 
405 410 415 

Glu Asp Ala Ala Leu Leu Thr Glu Ala Asn Leu Pro Ala Pro Ala Pro 
420 425 430 

Ala Ser Ala Ser Ala Pro Val Leu Glu Thr Leu Gly Ser Ser Glu Pro 
435 440 445 
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Ala Giy Gly Ala Leu Arg Gin Arg Pro Thr Cys Ser Ser Ser 
450 455 460 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GGAATAGAGG ATTTCAAAAA GCATGCGTTT TTTGAAGGTC TAAATTGGGA AAATATACGA 60 

AACCTAGAAG CACCTTATAT TCCTGATGTG AGCAGTCCCT CTGACACATC CAACTTCGAC 120 

GTGGATGACG ACGTGCTGAG AAACACGGAA ATATTACCTC CTGGTTCTCA CACAGGCTTT 180 
TCTGGATTAC ATTTGCCATT CATTGGTTTT ACATTCACAA CGGAAAGCTG TTTTTCTGAT - 2 4.0 

CGAGGCTCTC TGAAGAGCAT AATGCAGTCC AACACATT.^A CCAAAGATGA GGATGTGCAG 3 00 

CGGGACCTGG AGCACAGCCT GCAGATGGAA GCTTACGAGA GGAGGATTCG GAGGCTGGAA 3 60 

CAGGAGAAGC TGGAGCTGAG CAGGAAGCTG CAAGAGTCCA CCCAGACCGT GCAGTCCCTC 4 20 

CACGGCTCAT CTCGGGCCCT CAGCAATTCA AACCGAGATA AAGAAATCAA AAAGCTAAAT 4 80 

GAAGAAATCG AACGCTTGAA GAATAAAATA GCAGATTCAA ACAGGCTGGA GCGACAGCTT 540 

GAGGACACAG TGGCGCTTCG CCAAGAGCGT GAGGACTCCA CGCAGCGGCT GCGGGGGCTG 6 00 

GAGAAGCAGC ACCGCGTGGT CCGGCAGGAG AAGGAGGAGC TGCACAAGCA ACTGGTTGAA 660 

GCCTCAGAGC GGTTGAAATC CCAGGCCAAG GAACTCAAAG ATGCCCATCA GCAGCGAAAG 720 

CTGGCCCTGC AGGAGTTCTC GGAGCTGAAC GAGCGCATGG CAGAGCTCCG TGCCCAGAAG 7 80 

CAGAAGGTGT CCCGGCAGCT GCGAGACAAG GAGGAGGAGA TGGAGGTGGC CACGCAGAAG 340 

GTGGACGCCA TGCGGCAGGA AATGCGGAGA GCTGAGAAGC TCAGGAAAGA GCTGGAAGCT 900 
CAGCTTGATG ATGCTGTTGC TGAGGCCTCC AAGGAGCGCA AGCTTCGTGA GCACAGCGAG . 960 

AACTTCTGCA AGCAAATGGA AAGCGAGCTG GAGGCCCTCA AGGTGAAGCA AGGAGGCCGG 1020 

GGAGCGGGTG CCACCTTAGA GCACCAGCAA GAGATTTCCA AAATCAAATC CGAGCTGGAG 108 0 

AAGAAAGTCT TATTTTATGA AGAGGAATTG GTCAGACGTG AGGCCTCCCA TGTGCTAGAA 1140 
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GTGAAAAATG TGAAGAAGGA GGTGCATGAT 
GAAATCTTGA TGTTAAAAGA TAAGTTAGAA 
5 GAGGAGGCAG TAGGTACAAT AAAAGATAAA 
GAAAACAAGA AGCTAACTGC TGAAAATGAA 
GCTCAAAATA GACAGCTGGA GGATGAGCTG 

10 

GCCCACTGGG AAGCTCAGAT TGCGGAAATC 
CGGGGTTACC TTCAAGCTCT TGCTTCCAAG 
15 TCTAGTCTGG GGTCAAGAAC ACTGGACCCG 
GACATGTCCG CGCGGCTGGA GCTGCAGTCG 
CTTGTCCAGG AGGAGCTCAG GAAGGTCAAG 

20 

AWGGATTCCG AAGCCAAAAA CAGAGAATTA 
ATGGAAGAAA AATTCAGAGC AGATACTGGG 

2 5 TTTGAGTATT TCAACACTGC TCCTCTTGCA 

AGTGAGCAAG AAACACAAGC TCCGAAGCCA 
TCAGAGCAGC AGGAGGACAT GGCTCGGCCC 

30 

ACCACGCAGG CCCTGGCTCT GGCTGGACCG 
TCCTTCTCCA GCCCTACTCA GTGCAGCCAC 

3 5 CAGGGCTACG CCTGCGAGGT GTGTTCCTTT 

CCCCAGGTGT GCCCAATACC TCCCGAGCAG 
CGAGGCATCG GAACAGCCTA CAAAGGCCAT 

40 

AAGGGATGGC AGCGCGCATA TGCAGTCGTC 
CCTGAAGGAA AATCCACCCA GCCTGGTGTC 

4 5 GACGAGTTTT CCGTGAGCTC AGTCCTGGCC 

ATTCCATGTA TATTCAGGGT GACGGCCTCT 
CTGCTCATTC TGACAGAAAA. TGAGAATGAA 

50 

CTCCAGTCCA TCCTTCATAA AAACCGGCTG 
GCCTACGACA GCTCGCTGCC TCTCATCAAG 
55 GACAGGATTG CAGTCGGCCT AGAAGAAGGG 



TCAGAAAGCC ACCAGCTGGC CCTGCAGAAA 1200 

AAGTC.oAAGC GAGAACGGCA TAACGAGATG 1260 

TACGAACGAG AAAGAGCGAT GCTGTTTGAT 1320 

AAGCTCTGTT CCTTTGTGGA TAAACTCACA 13 80 

CAGGATCTGG CAGCCAAGAA GGAGTCAGTG 144 0 

ATTCAGTGGG TCAGTGACGA GAAAGATGCC 15 0 0 

ATGACCGAAG AGCTCGAGGC TTTGAGGAGT 15 60 

CTGTGGAAGG TGCGCCGCAG CCAGAAGCTG 162 0 

GCCCTGGAGG CGGAGATCCG GGCCAAGCAG 16 30 

GACGCCAACC TCACCTTGGA AAGCAAACYA 17 40 

TTAGAAGAAA TGGAAATTTT GAAGAAAAAG 18 00 

CTCAAACTTC CAGATTTTCA GGATTCCATT 18 60 

CATGACCTGA CATTTAGAAC CAGCTCAGCT ^1920 

GAAGCGTCCC CGTCGATGTC TGTGGCTGCA 1980 

CCGCAGAGGC CATCCGCTGT GCCGTTGCCC 2 040 

AAGCCAAAAG CTCACCAGTT CAGCATCAAG 210 0 

TGCACCTCCC TGATGGTTGG GCTGATCCGG 2160 

GCTTGCCACG TGTCCTGCAA AGACGGTGCC 222 0 

TCCAAGAGGC CTCTGGGCGT GGACGTGCAG 22 8 0 

GTCAAGGTCC CAAAGCCCAC GGGGGTGAAG 23.40 

TGTGACTGCA AGCTCTTCCT GTATGATCTG 24 00 

ATTGCGAGCC AAGTCTTGGA TCTCAGAGAT 24 6 0 

TCAGATGTCA TTCATGCTAC ACGCCGAGAT 252 0 

CTCTTAGGTG CACCTTCTAA GACCAGCTCG 258 0 

AAGAGGAAGT GGGTTGGGAT TCTAGAAGGA 2 640 

AGGAATCAGG TCGTGCATGT TCCCTTGGAA 27 00 

GCCATCCTGA CAGCTGCCAT CGTGGATGCA 27 60 

CTCTATGTCA TAGAGGTCAC CCGAGATGTG 2 820 
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ATCGTCCGTG CCGCTGACTG TAAGAAGGTA CACCAGATCG AGCTTGCTCC CAGGGAGAAG 288 0 

ATCGTAATCC TCCTCTGTGG CCGGAACCAC CATGTGCACC TCTATCCGTG GTCGTCCCTT 294 0 

5 GATGGAGCGG AAGGCAGCTT TGACATCAAG CTTCCGGAAA CCAAAGGCTG CCAGCTCATG 3 00 0 

GCCACGGCCA CACTCAAGAG GARCTCTGGC ACCTGCCTGT TTGTGGCCGT GAAACGGCTG 3 060 

ATCCTTTGCT ATGAGATCCA GAAAATAAAG CCATATTGAA TGATAAAAAA AAAAAAAAAA 312 0 

10 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA .AA-AAAAAAAA AAAAAAAAAA AAAAAAAAAA 3180 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 3213 
15 (2) INrORMATICN FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 5 amino acids 

(B) TYPE: amino acid 
2 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



25 



35 



50 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



3 0 Met Gin Ser Asn Thr Leu Thr Lys Asp Glu Asp Val Gin Arg Asp Leu 

1 5 10 15 



Glu His Ser Leu Gin Met Glu Ala Tyr Glu Arg Arg lie Arg Arg Leu 

20 25 30 

Glu Gin Glu Lys Leu Glu Leu Ser Arg Lys Leu Gin Glu Ser Thr Gin 
35 40 45 



Thr Val Gin Ser Leu His Gly Ser Ser Arg Ala Leu Ser Asn Ser Asn 
40 50 55 60 

Arg Asp Lys Glu lie Lys Lys Leu Asn Glu Glu lie Glu Arg Leu Lys 

65 70 75 80 

4 5 Asn Lys He Ala Asp Ser Asn Arg Leu Glu Arg Gin Leu Glu Asp Thr 

85 90 95 



Val Ala Leu Arg Gin Glu Arg Glu Asp Ser Thr Gin Arg Leu Arg Gly 
- 100 105 110 

Leu Glu Lys Gin His Arg Val Val Arg Gin Glu Lys Glu Glu Leu His 
115 120 125 



Lys Gin Leu Val Glu Ala Ser Glu Arg Leu Lys Ser Gin Ala Lys Glu 
55 130 135 140 
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Leu Lys Asp Ala Kis Gin Gin Arg Lys Leu Ala Leu Gin Glu Phe Ser 
145 150 155 160 

Glu Leu Asn Glu Arg Met Ala Glu Leu Arg Ala Gin Lys Gin Lys Val 
5 165 170 175 

Ser Arg Gin Leu Arg Asp Lys Glu Glu Glu Mec Glu Val Ala Thr Gin 
180 185 190 

10 Lys Val Asp Ala Met Arg Gin Glu Met Arg Arg Ala Glu Lys Leu Arg 

195 200 205 



15 



30 



45 



Lys Glu Leu Glu Ala Gin Leu Asp Asp Ala Val Ala Glu Ala Ser Lys 
210 215 220 

Glu Arg Lys Leu Arg Glu His Ser Glu Asn Phe Cys Lys Gin Met Glu 

225 230 235 240 



Ser Glu Leu Glu Ala Leu Lys Val Lys Gin Gly Gly Arg Gly Ala Gly 

20 245 250 255 

Ala Thr Leu Glu His Gin Gin Glu lie Ser Lys lie Lys Ser Glu Leu 

250 265 270 

2 5 Glu Lys Lys Val . Leu Phe Tyr Glu Glu Glu Leu Val Arg Arg Glu Ala 

275 280 " 285 



Ser His Val Leu Glu Val Lys Asn Val Lys Lys Glu Val His Asp Ser 

290 ' 295 300 

Glu Ser His Gin Leu Ala Leu Gin Lys Glu lie Leu Met Leu Lys Asp 

305 310 315 320 



Lys Leu Glu Lys Ser Lys Arg Glu Arg His Asn Glu Met Glu Glu Ala 
35 325 330 335 

Val Gly Thr lie Lys Asp Lys Tyr Glu Arg Glu Arg Ala Met Leu Phe 
340 345 350 

40 Asp Glu Asn Lys Lys Leu Thr Ala Glu Asn Glu Lys Leu Cys Ser Phe 

355 360 365 

Val Asp Lys Leu Thr Ala Gin Asn Arg Gin Leu Glu Asp Glu Leu Gin 
370 375 380 



Asp Leu Ala Ala Lys Lys Glu Ser Val Ala His Trp Glu Ala Gin lie 
. 385 390 395 400 



Ala Glu lie lie Gin Trp Val Ser Asp Glu Lys Asp Ala Arg Gly Tyr 
50 405 410 415 

Leu Gin Ala Leu Ala Ser Lys Met Thr Glu. Glu Leu Glu Ala Leu Arg 
420 425 430 

55 Ser Ser Ser Leu Gly Ser Arg Thr Leu Asp Pro Leu Trp Lys Val Arg 
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435 440 445 

Arg Ser Gin Lys Leu Asp Met Ser Ala Arg Leu Glu Leu Gin Ser Ala 
450 455 460 

Leu Glu Ala Glu He Arg Ala Lys Gin Leu Val Gin Glu Glu Leu Arg 
465 470 475 480 

Lys Val Lys Asp Ala Asn Leu Thr Leu Glu Ser Lys Xaa Xaa Asp Ser 
10 485 490 495 

Glu Ala Lys Asn Arg Glu Leu Leu Glu Glu Mec Glu He Leu Lys Lys 
500 505 510 

15 Lys Met Glu Glu Lys Phe Arg Ala Asp Thr Gly Leu Lys Leu Pro Asp 

515 520 525 



20 



35 



50 



Phe Gin Asp Ser He Phe Glu Tyr Phe Asn Thr Ala Pro Leu Ala His 

530 ' 535 540 

Asp Leu Thr Phe Arg Thr Ser Ser Ala Ser Glu- Gin Glu Thr Gin Ala 

545 550 555 560 



Pro Lys Pro Glu Ala Ser Pro Ser Met Ser Val Ala Ala Ser Glu Gin 
-25 565 570 575. 

Gin Glu Asp Met Ala Arg Pro Pro Gin Arg Pro Ser Ala Val Pro Leu 
580 585 590 

3 0 Pro Thr Thr Gin Ala Leu Ala Leu Ala Gly Pro Lys Pro Lys Ala His 

595 600 605 



Gin Phe Ser He Lys Ser Phe Ser Ser Pro Thr Gin Cys Ser His Cys 

610 615 620 

Thr Ser Leu Met Val Gly Leu He Arg Gin Gly Tyr Ala Cys Glu Val 

625 630 635 640 



Cys Ser Phe Ala Cys His Val Ser Cys Lys Asp Gly Ala Pro Gin Val 
40 645 650 655 

Cys Pro He Pro Pro Glu Gin Ser Lys Arg Pro Leu Gly Val Asp Val 
660 665 670 

45 Gin Arg Gly He Gly Thr Ala Tyr Lys Gly His Val Lys Val Pro Lys 

675 680 635 

Pro Thr Gly Val Lys Lys Gly Trp Gin Arg Ala Tyr Ala Val Val Cys 
690 ■ 695 700 



Asp Cys Lys Leu Phe Leu Tyr Asp Leu Pro Glu Gly Lys Ser Thr Gin 
705 710 715 720 



Pro aiy Val He Ala Ser Gin Val Leu Asp Leu Arg Asp Asp Glu Phe 
55 725 730 735 
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Ser Val Ser Ser Val Leu Ala Ser Asp Val lie His Ala Thr Arg Arg 
740 745 750 

Asp lie Pro Cys lie Phe Arg Val Thr Ala Ser Leu Leu Gly Ala Pro 

5 755 760 765 

Ser Lys Thr Ser Ser Leu Leu lie Leu Thr Glu Asn Glu Asn Glu Lys 
770 775 780 

10 Arg Lys Trp Val Gly lie Leu Glu Gly Leu Gin Ser lie Leu His Lys 

785 790 795 800 



15 



30 



50 



55 



Asn Arg Leu Arg Asn Gin Val Val His Val Pro Leu Glu Ala Tyr Asp 
805 810 815 

Ser Ser Leu Pro Leu lie Lys Ala lie Leu Thr Ala Ala lie Val Asp 
820 825 830 



Ala Asp Arg lie Ala Val Gly Leu Glu Glu Gly Leu Tyr Val lie Glu 

20 835 840 845 

Val Thr Arg Asp Val lie Val Arg Ala Ala Asp Cys Lys Lys Val His 

850 355 860 

25 Gin lie Glu Leu Ala Pro Arg Glu Lys lie Val lie -Leu Leu Cys Gly. 

865 870 375 880 



Arg Asn His His Val His Leu Tyr Pro Trp Ser Ser Leu Asp Gly Ala 

885 890 895 

Glu Gly Ser Phe Asp lie Lys Leu Pro Glu Thr Lys Gly Cys Gin Leu 

900 905 910 



Met Ala Thr Ala Thr Leu Lys Arg Xaa Ser Gly Thr Cys Leu Phe Val 
35 915 920 925 

Ala Val Lys Arg Leu lie Leu Cys Tyr Glu lie Gin Lys lie Lys Pro 
930 935 940 

4 0 Tyr 
945 

(2) INFORMATION FOR SEQ ID NO : 5 : 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1315 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND EDNES S : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SSQUHNCS DESCRIPTION: SEQ ID NO : 5 : 

GAGGGCACTT AATCCCAATG AACTGTATGC TTAAAAATAA TTTAAATGAT AAACTTTGTG 60 

TTATGTATAC TTTACCACAA TAAGAAAAA.G TATTTTAGTA CTAGTGGTAA ATAGTTTTTA 120 

TTTAATAGAC TTATATTTTA AAGCTTAAAA ATAATTTAGC TTCTAGAGTA TTACGTTTTT 130 

CTTCATGGGA ACTTCAAAAA GCAAGTCACT AAATCCAAGA ATTTTAAAGA AAAAACCCAA 2 40 

ATACATGATT TATGCTGCAT CTGGTATAGA TTTTTAAAAG ACTAGTCAAT CTAAGCTCTA 3 DC 

AACTATTAAA TGACAJiLACCA TTTCATATGT CATTGCATAT TCCTATGTAC CACATTCTCA 3 60 

TATTTCTGTT ATGGGCATGA AGGGGTGTTT GATGCTTCCA TGCCATAATA ACCATGACTA 420 

TCACAACCAT TGAAATAAAG GTTCTTGCAG TATTTTCAGG ATGGTCCCAG AAATTTAAAT 4 80 

TAATCTCTCA TCCATTGGCT TTTGCTACTT TAGGTTAATA TTAAAATATA ACATACATTT 54 0 

TTGGGGTTTA TGCTGTTAGC TCCAAACCAA AAGATTTTGG AAATTTATTT TGGAAATTTT 500 

GTGTTTAGAA TATGAATAAA TCTGCTTATT CAGAAAAATT AAACCTTGAT AACTTGGGAC 660 

CTCCTATTCC TGTATGTTCT CTGACATACA TTGAGGGATT TGGCTCTCTT TTGTTTATTT - 720 

GTTTTACTAG TCAGACATTC CTTTGGCTGC CCATACTTAA TTCTGTTGGG TGTTTCCGCC 780 

CCCGCCCTCA GCTTCTGCAG CTACTCTGAT CAACATCCGC AATGCCAGGA AACACTTTGA 840 

AAAGCTGGAA AGAGTGGATG GACCAAAGCA GTGTCTTCTC ATGCGCTAAA CATTGATGAA 9 00 

TATTGTTTCA CAGAAAAATT AAAAGTTTCC TAATTAATGT TGTATTCATA TATGTAGGCT 9 60 

CTGAAATGTT GTGATGCTTA TTGCTTCTGT ATTTCTTCTC TACTCCCTAG TCTTAATGTT 102 0 

TAACCTTGAA TGCTATTAAC TTAAATAGCC ATTGAGGAGT TAGAAGATGA ATTGTTCATG 10 80 

AAGTCGGTGT TACATAAAAG TAGGTGATAT GTAAGTTTTC TGATAACAAG GTTCTAATAG 114 0 

TGTTTAAATG TACTGGTAAC CTGGTTCCAA TAGTTGTGTT TGCCCAAGCC TTTCTCGGCA 120 0 

TCATCTTGTA TTCCTTATCA GATAGTAAGT AACCTGTAAG TTTGGAGTAT TACTGTTTTC 12 50 

TCAGCATGCA TTAAAAATAT TCCTTAACTT CAATTGTAAA AAAAAAAAAA AAAAA 1315 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: - ' 

(A) LENGTH: .6 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asn Lys Sex Ala Tyr Ser Glu Lys Leu Asn Leu Asp Asn Leu Gly 
15 10 15 

Pro Pro lie Pro Val Cys Ser Leu Thr Tyr lie Glu Gly Phe Gly Ser 
20 25 30 

Leu Leu Phe lie Cys Phe Thr Ser Gin Thr Phe Leu Trp Leu Pro lie 
35 40 45 

Leu Asn Ser Val Gly Cys Phe Arg Pro Arg Pro Gin Leu Leu Gin Leu 
50 55 50 

Leu 
65 

(2) INFOFtMATION rOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

TAGGCCATGA AGGCCGAATC GGCCTTCATG GCCTACGCTT ACACAATACC CACCATGTCC 60 

CAGGCTGGTG CTCAGGAAGC CCCTATCAAG AAGAAGCGCC CCCCTGTGAA GGAGGAGGAC 120 

CTGAAGGGGG CCCGAGGAAA CCTGACCAAG AACCAGGAAA TCAAGTCCAA GACCTACCAG 180 

GTCATGCGAG AGTGTGAGCA AGCTGGCTCG GCCGCCCCGT CGGTGTTCAG CCGCACCCGC 240 

ACAGGTACCG AGACTGTCTT TGAGAAGCCC AAAGCCGGAC CCACCAAGAG TGTCTTCGGC 3 00 

TGAGAAGTGT GCGCCACTCC CCTTGCTGCC CGAATGCTCG GAAACAGGAG CCTTACCCAG 360 

GAACTCTTTT TTATGCCAGA ACGCTTCCTC TCCCCTGCTG TCTCTGGGGC TGCCACCCTC 42 0 

CCCCACAGTC CAGGCCCTTC AGCCAAGGGC TCTGCACCAG CACCTTGGAA GCACCAATAA 4 80 

AGAGGATGCC CACGTGGCCC CAGCAAAAAA AAAAAAAAA 519 
(2) INFORMATION FOR SEQ ID NO:8: 
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(i) SEQU£NCE CHARACTERISTICS: 

(A) LENGTH: 9 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

5 (D) TOPOLOGY: linear 

(ii) MOLECULE- TYPE : protein 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met Lys Ala Glu Ser Ala Phe Met Ala Tyr Ala Tyr Thr lie Pro Thr 
15 1 5 10 " 15 

Met Ser Gin Ala Gly Ala Gin Glu Ala Pro lie Lys Lys Lys Arg Pro 
20 25 30 

2 0 Pro Val Lys Glu Glu Asp Leu Lys Gly Ala Arg Gly Asn Leu Thr Lys 

35 40 - 45 



25 



Asn Gin Glu lie Lys Ser Lys Thr Tyr Gin Val Met Arg Glu Cys Glu 

50 .55 60 . 

Gin Ala Gly Ser Ala Ala Pro Ser Val Phe Ser Arg Thr Arg Thr Gly 
55 70 75 80 



Thr Glu Thr Val Phe Glu Lys Pro Lys Ala Gly Pro Thr Lys Ser Val 
30 85 90 95 

Phe Gly 

3 5 (2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2788 base pairs 
(3) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double' 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: cDNA 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
50 GACGGCGACC AAACCCAGCT AGGTCAGACG AGAAAGATAA AAACTCTCCA GATGTCTTCC 60 
AGTAATGTCG AAGTTTTTAT CCCAGTGTCA CAAGGAAACA CCAATGGCTT CCCCGCGACA .120 
GCTTCCAATG ACCTGAAGGC ATTTACTGAA GGAGCTGTGT" TAAGTTTTCA TAACATCTGC 180 

55 
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TATCGAGTAA AACTGAAGAG TGGCTTTCTA 
TTATCGAATA TCAATGGGAT CATGAAACCT 
5 GGAHGCAAAT CTTCGTTATT AGATGTCTTA 
GGAGATGTTC TGATAAATGG AGCACCGCGA 
GTGGTACAAG TTGGAACTCA GTTTATCCGT 

10 

AGTATAGGAA TGGAGCTTAT CACTGATCCT 
GGCTTAGACT CAAGCACAGC AAATGCTGTC 
15 GGACGAACAA TCATCTTCTC CATTCATCAG 
AGCCTCACCT TATTGGCCTC AGGAAGACTT 
GGATACTTTG AATCAGCTGG TTATCACTGT 

20 

TTGGACATCA TTAATGGAGA TTCCACTGCT 
GCCACAGAGA TCATAGAGCC TTCCAAGCAG 

2 5 ATTTATGTCA ACTCCTGCTT CTACAAAGAG 

GGTGAGAAGA AGAAGAAGAT CACAGTCTTC 
CATCAACTCA GATGGGTTTC CAAGCGTTCA 

30 

TCTATAGCTC AGATCATTGT CACAGTCGTA 
GGGCTAAAAA ATGATTCTAC TGGAATCCAG 

3 5 ACCAACCAGT GTTTCAGCAG TGTTTCAGCC 

TTCATACATG AATACATCAG CGGATACTAC 
TTATCTGATT TATTACCCAT GAGGATGTTA 

40 

TTCATGTTAG GATTGAAGCC AAAGGCAGAT 
ATGGTGGCTT ATTCAGCCAG TTCCATGGCA 

45 TCTGTAGCAA CACTTCTCAT GACCATCTGT 
TTGiSTCAATC TCACAACCAT TGCATCTTGG 
CGATATGGAT TTACGGCTTT GCAGCATAAT 
CTCAATGCAA CAGGAAACAA TCCTTGTAAC 
GTAAAGCAGG GCATCGATCT CTCACCCTGG 

55 TGTATGATTG TTATTTTCCT CACAATTGCC 



CCTTGTCGAA AACCAGTTGA GAAAGAAATA 2 40 

GGTCTCAACG CCATCCTGGG ACCCACAGGT 3 00 

GCTGCAAGGA AAGATCC.AAG TGGATTATCT 3 50 

CCTGCCAATT TCA-\ATGTAA TTCAGGTTAC 420 
GGTGTGTCTG GAGGAGAA-AG AAAAAGGACT 480 
TCCATCTTGT TCTTGGATGA GCCTACAACT 540 
CTTTTGCTCC TGAAAAGGAT GTCTAAGCAG 500 
CCTCGATATT CCATCTTCAA GTTGTTTGAT 660 
ATGTTCCACG GGCCTGCTCA GGAGGCCTTG 72 0 

GAGGCCTATA ATAACCCTGC AGACTTCTTC 7 80 

GTGGCATTAA ACAGAGAAGA AGACTTTAAA 840 
GATAAGCCAC TCATAGAAAA ATTAGCGGAG 9 00 

ACAJiAAGCTG AATTACATCA ACTTTCCGGG 9 60 

AAGGAGATCA GCTACACCAC CTCCTTCTGT 102 0 

TTCAAAAACT TGCTGGGTAA TCCCCAGGCC 10 8 0 

CTGGGACTGG TTATAGGTGC CATTTACTTT 1140 

AACAGAGCTG GGGTTCTCTT CTTCCTGACG 12 00 

GTGGAACTCT TTGTGGTAGA GAAGAAGCTC 12 60 

AGAGTGTCAT CTTATTTCCT TGGAAAACTG 13 20 

CCAAGTATTA TATTTACCTG TATAGTGTAC 13 30 

GCCTTCTTCG TTATGATGTT TACCCTTATG 14 40 

CTGGCCATAG CAGCAGGTCA GAGTGTGGTT 1500 

TTTGTGTTTA TGATGATTTT TTCAGGTCTG 1560 

CTGTCATGGC TTCAGTACTT CAGCATTCCA 162 0 

GAATTTTTGG GACAAAACTT CTGCCCAGGA 158 0 

TATGCAACAT GTACTGGCGA AGAATATTTG 174 0 

GGCTTGTGGA AGAATCACGT GGCCTTGGCT 1300 

TACCTGAAAT TGTTATTTCT TAAAAAATAT 136 0 
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TCTTAAATTT CCCCTTAATT CAGTATGATT TATCCTCACA TAAAAAAGAA GCACTTTGAT 192 0 

TGAAGTATTC AATCAAGTTT TTTTGGTTGT TTTCTGTTCC CTTGCCATCA CACTGTTGCA 198 0 

5 CAGCAGCAAT TGTTTTAAAG AGATACATTT TTAGAAATCA CAACAAACTG AATTAAACAT 204 0 

GAAAGAACCC AAGACATCAT GTATCGCATA TTAGTTAATC TCCTCAGACA GTAACCATGG 210 0 

GGAAGAAATC TGGTCTAATT TATTAATCTA AAAAAGGAGA ATTGAATTCT GGAAACTCCT 216 0 

10 

GACAAGTTAT TACTGTCTCT GGCATTTGTT TCCTCATCTT TAA-AATGAAT AGGTAGGTTA 22 20 

GTAGCCCTTC AGTCTTAATA CTTTATGATG CTATGGTTTG CCATTATTTA ATAAATGACA 22 80 

15 AATGTATTAA TGCTAAAAAA AAAAAAAAAA AGCGGCCTTC ATGGCCTAGA GATTTCAACT 2340 

TAACTTGACC GCTCTGAGCT AAACCTAGCC CCAAACCCAC TCCACCTTAT TACCAGACAA 2 4 00 

CCTTAACC^iJlL ACCATTTACC CAJlATAAAGT ATAGGCGATA GAAATTGAAA CCTGGCGCAA 2 4 60 

20 

TAGATATAGT ACCGCAAGGG AAAGATGAAA AATTATAACC AAGCATAATA TAGCAAGGAC 2 52 0 

TAACCCCTAT ACCTTCTGCA TAATGAATTA ACTAGAAATA ACTTTGCAAG GAGAGCCAAA 25 80 

2 5 GCTAAGACCC CCGAAACCAG ACGAGCTACC TAAGAACAGC TAAAAGAGCA CACCCGTCTA 2 640 

TGTAGCAAAA TAGTGGGAAG ATTTATAGGT AGAGGCGACA AACCTACCGA GCCTGGTGAT 2700 

AGCTGGTTGT CCCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2760 

30 

AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2788 
■ (2) INFORMATION FOR SEQ ID NO: 10: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



40 



45 



50 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 



Met Ser Ser Ser Asn Val Glu Vai Phs lie Pro Val Ser Gin Gly Asn 

1 5 10 15 

Thr Asn Gly Phe Pro Ala Thr Ala Ser Asn Asp Leu Lys Ala Phe Thr 
20 25 30 

Glu Gly Ala Val Leu "Ser Phe His Asn lie Cys Tyr Arg Val Lys Leu 
55 35 40 45 
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(xi) SEQUENCE DESCRIPTION: SZQ ID NO: 5: 

GAGGGCACTT .QATCCCAATG AACTGTATGC TTAAAAATAA TTTAAATGAT AAACTTTGT3 60 

5 TTATGTATAC TTTACCACAA TAAGAAAA;i.G TATTTTAGTA CTAGTGGTAA ATAGTTTTTA 120 

TTTAATAGAC TTATATTTTA AAGCTTAAAA ATAATTTAGC TTCTAGAGTA TTACGTTTTT 130 

CTTCATGGGA ACTTCAAAAA GCAAGTCACT AAATCCAAGA ATTTTAAAGA AAAAACCCAA 2 40 

10 

ATACATGATT TATGCTGCAT CTGGTATAGA TTTTTAAAAG ACTAGTCAAT CTAAGCTCTA 3 00 

AACTATTAAA TGACAJ^lACCA TTTCATATGT CATTGCATAT TCCTATGTAC CACATTCTCA 3 60 

15 TATTTCTGTT ATGGGCATGA AGGGGTGTTT GATGCTTCCA TGCCATAATA ACCATGACTA 42 0 

TCACAACCAT TGAAATAAAG GTTCTTGCAG TATTTTCAGG ATGGTCCCAG AAATTTAAAT 480 

TAATCTCTCA TCCATTGGCT TTTGCTACTT TAGGTTAATA TTAAAATATA ACATACATTT 54 0 

20 

TTGGGGTTTA TGCTGTTAGC TCCAAACCAA AAGATTTTGG AAATTTATTT TGGAAATTTT 50 0 

GTGTTTAGAA TATGAATAAA TCTGCTTATT CAGAAAAATT AAACCTTGAT AACTTGGGAC 660 

2 5 CTCCTATTCC TGTATGTTCT CTGACATACA TTGAGGGATT TGGCTCTCTT TTGTTTATTT - 72 0 

GTTTTACTAG TCAGACATTC CTTTGGCTGC CCATACTTAA TTCTGTTGGG TGTTTCCGCC 780 

CCCGCCCTCA GCTTCTGCAG CTACTCTGAT CAACATCCGC AATGCCAGGA AACACTTTGA 840 

30 

AAAGCTGGAA AGAGTGGATG GACCAAAGCA GTGTCTTCTC ATGCGCTAAA CATTGATGAA 9 00 

TATTGTTTCA CAGAAAAATT AAAAGTTTCC TAATTAATGT TGTATTCATA TATGTAGGCT 9 60 

3 5 CTGAAATGTT GTGATGCTTA TTGCTTCTGT ATTTCTTCTC TACTCCCTAG TCTTAATGTT 102 0 

TAACCTTGAA TGCTATTAAC TTAAATAGCC ATTGAGGAGT TAGAAGATGA ATTGTTCATG 10 80 

AAGTCGGTGT TACATAAAAG TAGGTGATAT GTAAGTTTTC TGATAACAAG GTTCTAATAG 1140 

40 

TGTTTAAATG TACTGGTAAC CTGGTTCCAA TAGTTGTGTT TGCCCAAGCC TTTCTCGGCA 12 00 

TCATCTTGTA TTCCTTATCA GATAGTAAGT AACCTGTAAG TTTGGAGTAT TACTGTTTTC 12 50 

45 TCAGCATGCA TTAAAAATAT TCCTTAACTT CAATTGTAAA AAAAAAAAAA AAAAA 1315 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: - " 

50 (A) LENGTH: .6_5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Asn Lys Ser Ala Tyr Ser Glu Lys Leu Asn Leu Asp Asn Leu Gly 
15 10 15 

Pro Pro lie Pro Val Cys Ser Leu Thr Tyr lie Giu Gly Phe Gly Ser 
20 25 30 

Leu Leu Phe lie Cys Phe Thr Ser Gin Thr Phe leu Trp Leu Pro lie 
35 40 45 

Leu Asn Ser Val Gly Cys Phe Arg Pro Arg Pro Gin Leu Leu Gin Leu 
50 55 50 

Leu 
65 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 
(BJ TYPE: nucleic acid 
{ C ) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

TAGGCCATGA AGGCCGAATC GGCCTTCATG GCCTACGCTT ACACAATACC CACCATGTCC 60 

CAGGCTGGTG CTCAGGAAGC CCCTATCAAG AAGAAGCGCC CCCCTGTGAA GGAGGAGGAC 120 

CTGAAGGGGG CCCGAGGAAA CCTGACCAAG AACCAGGAAA TCAAGTCCAA GACCTACCAG 180 

GTCATGCGAG AGTGTGAGCA AGCTGGCTCG GCCGCCCCGT CGGTGTTCAG CCGCACCCGC 240 

ACAGGTACCG AGACTGTCTT TGAGAAGCCC AAAGCCGGAC CCACCAAGAG TGTCTTCGGC 3 00 

TGAGAAGTGT GCGCCACTCC CCTTGCTGCC CGAATGCTCG GAAACAGGAG CCTTACCCAG 3 60 

GAACTCTTTT TTATGCCAGA ACGCTTCCTC TCCCCTGCTG TCTCTGGGGC TGCCACCCTC 42 0 

CCCCACAGTC CAGGCCCTTC AGCCAAGGGC TCTGCACCAG CACCTTGGAA GCACCAATAA 4 80 

AGAGGATGCC CACGTGGCCC CAGCAAAAAA AAAAAAAAA - 519 
('2) INFORMATION FOR SEQ ID NO : 8 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

5 (D) TOPOLOGY: linear 

(ii) MOLECULE- TYPE: protein 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met Lys Ala Glu Ser Ala Phe Met Ala Tyr Ala Tyr Thr lie Fro Thr 
15 1 5 10 15 

Met Ser Gin Ala Gly Ala Gin Glu Ala Pro lie Lys Lys Lys Arg Pro 

20 25 30 

2 0 Pro Val Lys Glu Glu Asp Leu Lys Gly Ala Arg Gly Asn Leu Thr Lys 

35 40 - 45 



25 



Asn Gin Glu lie Lys Ser Lys Thr Tyr Gin Val Met Arg Glu Cys Glu 

50 ■ . 55 60 . 

Gin Ala Gly Ser Ala Ala Pro Ser Val Phe Ser Arg Thr Arg Thr Gly 

55 70 75 80 



Thr Glu Thr Val Phe Glu Lys Pro Lys Ala Gly Pro Thr Lys Ser Val 
30 85 90 95 

Phe Gly 

3 5 (2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2788 base pairs 
(3) TYPE: nucleic acid 

4 0 (C) STRANDEDNES S : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
50 GACGGCGACC AAACCCAGCT AGGTCAGACG AGAAAGATAA AAACTCTCCA GATGTCTTCC 60 
AGTAATGTCG AAGTTTTTAT CCCAGTGTCA CAAGGAAACA CCAATGGCTT CCCCGCGACA .120 
GCTTCCAATG ACCTGAAGGC ATTTACTGAA GGAGCTGTGT TAAGTTTTCA TAACATCTGC 180 

55 
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TATCGAGTAA AACTGAAGAG TGGCTTTCTA 
TTATCGAATA TCAATGGGAT CATGAAACCT 
5 GGARGCAAAT CTTCGTTATT AGATGTCTTA 
GGAGATGTTC TGATAAATGG AGCACCGCGA 
GTGGTACAAG TTGGAACTCA GTTTATCCGT 

10 

AGTATAGGAA TGGAGCTTAT CACTGATCCT 
GGCTTAGACT CAAGCACAGC AAATGCTGTC 
15 GGACGAACAA TCATCTTCTC CATTCATCAG 
AGCCTCACCT TATTGGCCTC AGGAAGACTT 
GGATACTTTG AATCAGCTGG TTATCACTGT 

20 

TTGGACATCA TTAATGGAGA TTCCACTGCT 
GCCACAGAGA TCATAGAGCC TTCCAAGCAG 

2 5 ATTTATGTCA ACTCCTCCTT CTACAAAGAG 

GGTGAGAAGA AGAAGAAGAT CACAGTCTTC 
CATCAACTCA GATGGGTTTC CAAGCGTTCA 

30 

TCTATAGCTC AGATCATTGT CACAGTCGTA 
GGGCTAAAAA ATGATTCTAC TGGAATCCAG 

3 5 ACCAACCAGT GTTTCAGCAG TGTTTCAGCC 

TTCATACATG AATACATCAG CGGATACTAC 
TTATCTGATT TATTACCCAT GAGGATGTTA 

40 

TTCATGTTAG GATTGAAGCC AAAGGCAGAT 
ATGGTGGCTT ATTCAGCCAG TTCCATGGCA 
45 TCTGTAGCAA CACTTCTCAT GACCATCTGT 
TTGCTCAATC TCACAACCAT TGCATCTTGG 
CGATATGGAT TTACGGCTTT GCAGCATAAT 

50 

CTCAATGCAA CAGGAAACAA *TCCTTGTAAC 
GTAAAGCAGG GCATCGATCT CTCACCCTGG 
55 TGTATGATTG TTATTTTCCT CACAATTGCC 



CCTTGTCGAA AACCAGTTGA GAAAGAAATA 2 40 

GGTCTCAACG CCATCCTGGG ACCCACAGGT 3 00 

GCTGCAAGGA AAGATCCAAG TGGATTATCT 3 SO 

CCTGCCAATT TCA-SATGTAA TTCAGGTTAC 420 

GGTGTGTCTG GAGGAGAAAG AAAAAGGACT 480 

TCCATCTTGT TCTTGGATGA GCCTACAACT 540 

CTTTTGCTCC TGAAAAGGAT GTCTAAGCAG 5 00 

CCTCGATATT CCATCTTCA.ZV GTTGTTTGAT 56 0 

ATGTTCCACG GGCCTGCTCA GGAGGCCTTG 72 0 

GAGGCCTATA ATAACCCTGC AGACTTCTTC 780 

GTGGCATTAA ACAGA"GAAGA AGACTTTAAA 84 0 

GATAAGCCAC TCATAGAAAA ATTAGCGGAG 900 

ACAAAAGCTG AATTACATCA ACTTTCCGGG 9 60 

AAGGAGATCA GCTACACCAC CTCCTTCTGT 102 0 

TTCAAAAACT TGCTGGGTAA TCCCCAGGCC 10 8 0 

CTGGGACTGG TTATAGGTGC CATTTACTTT 114 0 

AACAGAGCTG GGGTTCTCTT CTTCCTGACG 12 00 

GTGGAACTCT TTGTGGTAGA GAAGAAGCTC 12 60 

AGAGTGTCAT CTTATTTCCT TGGAAAACTG 13 20 

CCAAGTATTA TATTTACCTG TATAGTGTAC 13 30 

GCCTTCTTCG TTATGATGTT TACCCTTATG 14 40 

CTGGCCATAG CAGCAGGTCA GAGTGTGGTT 1500 

TTTGTGTTTA TGATGATTTT TTCAGGTCTG 1560 

CTGTCATGGC TTCAGTACTT CAGCATTCCA 162 0 

GAATTTTTGG GACAAAACTT CTGCCCAGGA 158 0 

TATGCAACAT GTACTGGCGA AGAATATTTG 174 0 

GGCTTGTGGA AGAATCACGT GGCCTTGGCT 130 0 

TACCTGAAAT TGTTATTTCT TAAAAAATAT 186 0 
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TCTTAAATTT CCCCTTAATT CAGTATGATT TATCCTCACA TAAAAAAGAA GCACTTTGAT 192 0 

TGAAGTATTC AATCAAGTTT TTTTGGTTGT TTTCTGTTCC CTTGCCATCA CACTGTTGCA 1980 

5 CAGCAGCAAT TGTTTTAAAG AGATACATTT TTAGAAATCA CAACAAP.CTG AATTAAACAT 2040 

GAAAGAACCC AAGACATCAT GTATCGCATA TTAGTTAATC TCCTCAGACA GTAACCATC^ 2100 

GGAAGAAATC TGGTCTAATT TATTAATCTA AAAAAGGAGA ATTGAATTCT GGAAACTCCT 216 0 

10 

GACAAGTTAT TACTGTCTCT GGCATTTGTT TCCTCATCTT TAAAATGAAT AGGTAGGTTA 222 0 

GTAGCCCTTC AGTCTTAATA CTTTATGATG CTATGGTTTG CCATTATTTA ATAAATGACA 22 8 0 

15 AATGTATTAA TGCTAAAAAA AAAAAAAAAA AGCGGCCTTC ATGGCCTAGA GATTTCAACT 23 40 

TAACTTGACC GCTCTGAGCT AAACCTAGCC CCAAACCCAC TCCACCTTAT TACCAGACAA 2 4 00 

CCTTAACCAA ACCATTTACC CAAATAAAGT ATAGGCGATA G^J^lATTGAAA CCTGGCGCAA 2 4 50 

2 0 ' . 

TAGATATAGT ACCGCAAGGG AAAGATGAAA AATTATA^iCC AAGCATA^TA TAGCAAGGAC 2 52 0 

TAACCCCTAT ACCTTCTGCA TAATGAATTA ACTAGAAATA ACTTTGCAAG GAGAGCCAAA 2 580 

25 GCTAAGACCC CCGAAACCAG ACGAGCTACC TAAGAACAGC TAAAAGAGCA CACCCGTCTA 2 640 

TGTAGCAAAA TAGTGGGAAG ATTTATAGGT AGAGGCGACA AACCTACCGA GCCTGGTGAT 270 0 

AGCTGGTTGT CCCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2760 

30 

AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2788 
■ (2) INFORMATION FOR SEQ ID NO: 10: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 04 amino acids 
{ B) TYPE : amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: protein 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Ser Ser Asn Val Glu Val Phs lie Pro Val Ser Gin Gly Asn 
1 5 10 15 

50 _ 

Thr Asn Gly Phe Pro Ala Thr Ala Ser Asn Asp Leu Lys Ala Phe Thr 
20 25 30 

Glu Gly Ala Val Leu Ser Phe His Asn lie Cys Tyr Arg Val Lys Leu 
55 35 40 45 
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Lys Ser Gly Phe Leu Pro Cys Arg Lys Pro Val Glu Lys Glu lie Leu 
50 55 60 

Ser Asn lie Asn Giy He Met Lys Pro Gly Leu Asn Ala He Leu Gly 
65 70 75 80 

Pro Thr Gly Gly Xaa Lys Ser Ser Leu Leu Asp Val Leu Ala Ala Arg 
85 90 95 

Lys Asp Pro Ser Gly Leu Ser Gly Asp Val Leu He Asn Gly Ala Pro 
100 105 110 

Arg Pro Ala Asn Phe Lys Cys Asn Ser Gly Tyr Val Val Gin Val Giy 
115 120 125 

Thr Gin Phe He Arg Gly Val Ser Gly Gly Glu Arg Lys Arg Thr Ser 
130 135 140 

lie Gly Met Glu Leu He Thr Asp Pro Ser He Leu Phe Leu Asp Glu 
145 150 155 160 

Pro Thr Thr Gly Leu Asp Ser Ser Thr Ala Asn Ala Val Leu Leu Leu 
165 170 175 

Leu Lys Arg Met Ser Lys Gin Gly Arg Thr He He Phe Ser He His 
ISO " 185 190 

Gin Pro Arg Tyr Ser He Phe Lys Leu Phe Asp Ser Leu Thr Leu Leu 
195 200 205 

Ala Ser Gly Arg Leu Met Phe His Gly Pro Ala Gin Glu Ala Leu Gly 
210 215 220 

Tyr Phe Glu Ser Ala Gly Tyr His Cys Glu Ala Tyr Asn Asn Pro Ala 
225 230 235 240 

Asp Phe Phe Leu Asp He He Asn Gly Asp Ser Thr Ala Val Ala Leu 
245 250 255 

Asn Arg Glu Glu Asp Phe Lys Ala Thr Glu He He Glu Pro Ser Lys 
260 265 270 

Gin Asp Lys Pro Leu He Glu Lys Leu Ala Glu He Tyr Val Asn Ser 
275 280 285 

Ser Phe Tyr Lys Glu Thr Lys Ala Glu Leu His Gin Leu Ser Gly Gly 
290 295 300 

Glu Lys Lys Lys Lys He Thr Val Phe Lys Glu He Ser Tyr Thr Thr 
305 310 315 320 

Ser Phe Cys His Gin Leu Arg Trp Val Ser Lys Arg Ser Phe Lys Asn 
325 330 335 

Leu Leu Gly Asn Pro Gin Ala Ser He Ala Gin He He Val Thr Val 
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340 345 350 

Val Leu Gly Leu Val lie Gly Ala lie Tyr Phe Gly Leu Lys Asn Asp 
355 360 365 

Ser Thr Gly lie Gin Asn Arg Ala Gly Val Leu Phe Phe Leu Thr Thr 
370 375 380 

Asn Gin Cys Phe Ser Ser Val Ser Ala Val Glu Leu Phe Val Val Glu 
385 390 395 400 

Lys Lys Leu Phe lie His Glu Tyr lie Ser Gly Tyr Tyr Arg Val Ser 
405 410 415 

Ser Tyr Phe Leu Gly Lys Leu Leu Ser Asp Leu Leu Pro Met Arg Met 
420 425 430 

Leu Pro Ser lie Ila Phe Thr Cys lie Val Tyr Phe Met Leu Gly Leu 
435 440 445 

Lys Pro Lys Ala Asp Ala Phe Phe Val Met Met. Phe Thr Leu Met Met 
450 455 450 

Val Ala Tyr Ser Ala Ser Ser Met Ala Leu Ala lie Ala Ala Gly Gin 
465 470 - 475 480 

Ser Val Val Ser Val Ala Thr Leu Leu Met Thr lie Cys Phe Val Phe 
435 490 495 

Met Met lie Phe Ser Gly Leu Leu Val Asn Leu Thr Thr lie Ala Ser 
500 505 510 

Trp Leu Ser Trp Leu Gin Tyr Phe Ser lie Pro Arg Tyr Gly Phe Thr 
515 520 525 

Ala Leu Gin His Asn Glu Phe Leu Gly Gin Asn Phe Cys Pro Gly Leu 
530 535 540 

Asn Ala Thr Gly Asn Asn Pro Cys Asn Tyr Ala Thr Cys Thr Gly Glu 
545 550 555 560 

Glu Tyr Leu Val Lys Gin Gly lie Asp Leu Ser Pro Trp Gly Leu Trp 
565 570 575 

Lys Asn His Val Ala Leu Ala Cys Met lie Val lie Phe Leu Thr lie 
580 585 590 

Ala Tyr Leu Lys Leu Leu Phe Leu Lys Lys Tyr Ser 

595 600 " . ' 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2930 base pairs 

(B) TYPE: nucleic acid 
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(C ) STRAiJDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CGACTTCCTC GGCTGCGCGG CGCTCGCGCG GAGCTCCCCG GCCGGCGGTG CGTCCCCACG 60 

GTCACCATGA AAGACGACTT CGCAGAGGAG GAGGAGGTGC AATCCTTCGG TTACAAGCGG 120 

TTTGGTATTC AGGAAGGAAC AC.QiATGTACC AAATGTAAAA ATAACTGGGC ACTGAAGTTT 180 

TCTATCATAT TATTATACAT TTTGTGTGCC TTGCTAACAA TCACAGTAGC CATTTTGGGA 24 0 

TATAAAGTTG TAGAGAAAAT GGACAATGTC ACAGGTGGCA TGGAAACATC TCGCCAAACC 3 00 

TATGATGACA AGCTCACAGC AGTGGAAAGT GACCTGAA.IVA AATTASGTGA CCAAACTGGG 3 60 

AAGAAAGCTA TCAGCACCAA CTCAGAACTC TCCACCTTCA GATCAGACAT TCTAGATCTC 42 0 
CGTCAGCAAC TTCGTGAGAT TACAGAAAAA ACCAGCAAGA ACAAGGATAC GCTGGAGAAG . 4.80 

TTACAGGCGA GCGGGGATGC TCTGGTGGAC AGGCAGAGTC AATTGAAAGA AACTTTGGAG 540 

AATAACTCTT TCCTCATCAC CACTGTAAAC AAAACCCTCC AGGCGTATAA TGGCTATGTC 600 

ACGAATCTGC AGCAAGATAC CAGCGTGCTC CAGGGCAATC TGCAGAACCA AATGTATTCT 6 60 

CATAATGTGG TCATCATGAA CTCAACAACC TGAACCTGAC CCAGGTGCAG CAGAGGAACC 720 

TCATCACGAA TCTGCAGCGG TCTGTGGATG ACACAAGCCA GGCTATCCAG CGAATCAAGA 780 

ACGACTTTCA AAATCTGCAG CAGGTTTTTC TTCAAGCCAA GAAGGACACG GATTGGCTGA 840 

AGGAGAAAGT GCAGAGCTTG CAGACGCTGG CTGCCAACAA CTCTGCGTTG' GCCAAAGCCA 900 

ACAACGACAC CCTGGAGGAT ATGAACAGCC AGCTCAACTC ATTCACAGGT CAGATGGAGA 960 

ACATCACCAC TATCTCTCAA GCCAACGAGC AGAACCTGAA AGACCTGCAG GACTTACACA 102 0 

AAGATGCAGA GAATAGAACA GCCATCAAGT TCAACCAACT GGAGGAACGC TTCCAGCTCT 108 0 

TTGAGACGGA TATTGTGAAC ATCATTAGCA ATATCAGTTA CACAGCCCAC CACCTGCGGA 114 0 

CGCTGACCAG- CAATCTAAAT GAAGTCAGGA CCACTTGCAC AGATACCCTT ACCAAACACA 1200 

CAGATGATCT GACCTCCTTG AATAATACCC TGGCCAACAT CCGTTTGGAT TCTGTTTCTC 1260 

TCAGGATGCA ACAAGATTTG ATGAGGTCGA GGTTAGACAC TGAAGTAGCC AACTTATCAG 1320 

TGATTATGGA AGAAATGAAG CTAGTAGACT CCAAGCATGG TCAGCTCATC AAGAATTTTA 1380 
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CAATACTACA AGGTCCACCG GGCCCCAGGG GTCCAAGAGG TGACAGAGGA TCCCAGGGAC 1440 

CCCCTGGCCC AACTGGCAAC AAGGGACAGA AAGGAGAGAA GGGGGAGCCT GGACCACCTG 1500 

5 GCCCTGCGGG TGAGAGAGGC CCAATTGGAC CAGCTGGTCC CCCCGGAGAG CGTGGCGGCA 1560 

AAGGATCTAA AGGCTCCCAG GGCCCCAAJ^G GCTCCCGTGG TTCCCCTGGG AAGCCCGGCC lo2 0 

CTCAGGGCCC CAGTGGGGAC CCAGGCCCCC CGGGCCCACC AGGCAAAGAG GGACTCCCCG 158 0 

10 

GCCCTCAGGG CCCTCCTGGC TTCCAGGGAC TTCAGGGCAC CGTTGGGGAG CCTGGGGTGC 174 0 

CTGGACCTCG GGGACTGCCA GGCTTGCCTG GGGTACCAGG CATGCCAGGC CCCAAGGGCC 180 0 

15 CCCCCGGCCC TCCTGGCCCA TCAGGAGCGG TGGTGCCCCT GGCCCTGCAG AATGAGCCAA 186 0 

CCCCGGCACC GGAGGACAAT AGCTGCCCGC CTCACTGGAA GAACTTCACA GACAAATGCT 192 0 

ACTATTTTTC AGTTGAGAAA GAAATTTTTG AGGATGCAAA GCTTTTCTGT GAAGACAAGT 19 8 0 

20 

CTTCACATCT TGTTTTCATA AACACTAGAG AGGAACAGCA ATGGATAAAA AAACAGATGG 204 0 

TAGGGAGAGA GAGCCACTGG ATCGGCCTCA CAGACTCAGA GCGTGAAAAT GAATGGAAGT 210 0 

25 GGCTGGATGG GACATCTCCA GACTACAAAA ATTGGAAAGC- TGGACAGCCG GATAACTGGG 216 0 

GTCATGGCCA TGGGCCAGGA GAAGACTGTG CTGGGTTGAT TTATGCTGGG CAGTGGAACG 222 0 

ATTTCCAATG TGAAGACGTC AATAACTTCA TTTGCGAAAA AGACAGGGAG ACAGTACTGT 2 2 80 

30 

CATCTGCATT ATAACGGACT GTGATGGGAT CACATGAGCA AATTTTCAGC TCTCAAAGGC 2 340 

AAAGGACACT CCTTTCTAAT TGCATCACCT TCTCATCAGA TTGAAAAAAA AAAAGCACTG 2 4 00 

35 AAAGCCAATT ACTGAAAAAA AATTGACAGC TAGTGTTTTT TACCATCCGT CATTACCCAA 2460 

AGACTTGGGA ACTAAAATGT TCCCCAGGGT GATATGCTGA TTTTCATTGT GCACATGGAC 2 52 0 

TGAATCACAT AGATTCTCCT CCGTCAGTAA -CCGTGCGATT ATACAAATTA TGTCTTCCAA 2 580 

40 

AGTATGGAAC ACTCCAATCA GAAAAAGGTT ATCATTGGTC GTTGAGTTAT GGGAAGAACT 2 54 0 

TAAGCATATA CTGTGTAAAC AGTGCCATAC ATTTCTAAAA TCCCAAGTGT AGGAAAAATA 270 0 

45 TGCAGACATA CAGATATATA GGCCAACTAT TAGTAATAAT ATGAAATATA CTTAAAGAGC 2 76 0 

TTTTAAAACT TTGTATTTTT GTACAAAATA TTTGTCTTTT ACAATTTTTT TCCTTTTTTT 282 0 

TTTTTTGTCA TTTTACCGAC ATAATACATG GAGCCAAAGA AAACAATAAT GGTACTAATA 2 88 0 

50 

AAAACTCCTA GGGTTTCCTG TCAGATTTAA TTCTAAAAAA AAAAAAAAAA ' 293 0 
(2) INFORMATION FOR SEQ ID NO : 12 : 
55 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 208 amino acids 

(B) TYPE: eutiino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : L inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Lys Asp Asp Phe Ala Glu Glu Glu Giu Val Gin Ser Phe Gly Tyr 
15 10 15 

Lys Arg Phe Gly lie Gin Glu Gly Thr Gin Cys Thr Lys Cys Lys Asn 
20 25 30 

Asn Trp Ala Leu Lys Phe Ser lie lie Leu Leu Tyr lie Leu Cys Ala 
35 40 45 

Leu Leu Thr lie Thr Val Ala lie Leu Gly Tyr Lys Val Val Glu Lys 
50 55 60 

Met Asp. Asn Val Thr Gly Gly Met Glu Thr Ser Arg Gin Thr Tyr Asp 
65 70 75 80 

Asp Lys Leu Thr Ala Val Glu Ser Asp Leu Lys Lys Leu Gly Asp Gin 
85 90 95 

Thr Gly Lys Lys Ala lie Ser Thr Asn Ser Giu Leu Ser Thr Phe Arg 
100 105 110 

Ser Asp lie Leu Asp Leu Arg Gin Gin Leu Arg Glu lie Thr Glu Lys 
115 120 125 

Thr Ser Lys Asn Lys Asp Thr Leu Glu Lys Leu Gin Ala Ser Gly Asp 
130 135 140 

Ala Leu Val Asp Arg Gin Ser Gin Leu Lys Glu Thr Leu Glu Asn Asn 
145 150 155 160 

Ser Phe Leu lie Thr Thr Val Asn Lys Thr Leu Gin Ala Tyr Asn Gly 
165 170 175 

Tyr Val Thr Asn Leu Gin Gin Asp Thr Ser Val Leu Gin Gly Asn Leu 
' 160 185 190 

Gin Asn Gin Met Tyr Ser His 'Asn Val Val He Met Asn Ser Thr Thr 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 



89 



wo 98/55614 PCT/US9S/1 1210 



(A) LENGTH: 1589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D ) TO POLOGY : 1 inear 

5 

(ii) MOLECULE TYPE: cDNA 



10 

(xi) SEQUENCE DESCRIPTION: 3EQ ID NO:13: 



TCTATATATT TTTTCTAGGA AGGGGTGTTT TTCTTTCTGA TTTAATTCCC TACATTTTTC 60 



15 TCTTTCATAT GAAGTTGCAG ATAATGTTTT TCCTTCGGAT TTTTATTCTT TAAGATTTTT 120 



AACCTGTGCA AGACTTTTTC AATGATACAA GTCAAGGAGG ATGAAGATCT TTTTCCACTT 180 



CAGTCTTCAC TTTGCTCCAG CTATTGCTAA GAAAGGCACA AACAATGACA GCATATTTAA 24 0 

20 

GGAAGAACCT GGCCGGCTTG GGTCACCGCT GCTGTCTTTC TTGGTTTTGC GTCTACCTGG 3 00 



GAGAGCCCAG CTTTTAGGTT CCCATTGAGG GAAGCATGAG AGAGGATTGT TTGGGGGATG 360 



2 5 . CTGCCAGAGC TTCCAGCTGA CAGTCTCTGC AGAGCGGCTG CCAAGTGGCC TGGTGGCCGT 42 0 



ATGTTGGCAG TTTTTGATGA ATTGGGATTA GGGAATGTTT GTTTACTTGA TAACCGAGTG 480 



TCTACAAGGA GAGGTGGCAG CGTGAGGGAA TAGTGCCACC ATAATGAGGA CACAGCCAGC 540 

30 

CATCTCTTCC CTGCCACAGA ACCCCAGGCA GTCCCCTTCA GGCTACAGTT TTCCATCTGG 600 



ACCGAGGGAC TGGCCGGTGC AGCAGGAGGA GCCGATCACC CTCTGTGGGA ACGAGGATGC 6 60 



3 5 CCAGAAGTTC CAGTTACTGT GGCTCCATGG TCCCCTTCTC GATGCGCATC TTGCACGCGG 7 20 



AGCTTCAGCA GTACCTGGGG AACCCACAGG AGTCGCTGGA TAGACTGCAC AAGGTGAAGA 7 80 

CTGTCTGCAG CAAGATCCTG . GCCAATTTGG AGCAAGGCTT AGCAGAAGAC GGCGGCATGA 840 

40 

GCAGCGTGAC TCAGGAGGGC AGACAAGCCT CTATCCGGCT GTGGAGGTCA CGTCTGGGCC 900 



GGGTGATGTA CTCCATGGCA AACTGTCTGC TCCTGATGAA GGATTATGTG CTGGCCGTGG 9 50 



45 AGGCGTATCA TTCGGTTATC AAGTATTACC CAGAGCAAGA GCCCCAGCTG CTCAGCGGCA 1020 



TCGGCCGGAT TTCCCTGCAG ATTGGAGACA TAAAAACAGC TGAAAAGTAT TTTCAAGACG 1080 



TTGAGAAAGT AACACAGAAA TTAGATGGAC TACAGGGTAA AATCATGGTT TTGATGAACA 1140 

50 _ 

GCGCGTTCCT TCACCTCGGG CAGAATAACT TTGCAGAAGC CCACAGGTTC TTCACAGAGA 1200 



TCTTAAGGAT GGATCCAAGA AACGCAGTGG CCAACAACAA CGCTGCCGTG TGTCTGCTCT 1260 
5 5 ACCTGGGCAA GCTCAAGGAC TCCCTGCGGC AGCTGGAGGC CATGGTCCAG CAGGACCCCA 132 0 
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GGCACTACCT GCACGAGAGC GTGCTCTTCA ACCTGACCAC CATGTACGAG CTGGAGTCCT 13 80 

CACGGAGCAT GCAGAAGAAA CAGGCCCTGC TGGAGGCTGT CGCCGGCAAG GAGGGGGACA 1440 

5 GCTTCAACAC ACAGTGCCTC AAGCTGGCCT AGCTGCCTCC AACACACTAC GTCAGAAGGA 15 00 

CCCGGGTCTT TGAAACTGTG TCTTGAAGCT AATGTATTAA TGTGACATGG AGGAACTCAA 15 60 

TAAAACTCCT GCTTCAAAAA AAAAAAAAA 1539 
(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : 1 inear 



20 



25 



40 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Pro Arg Ser Ser Ser Tyr' Cys Gly Ser Met Val Pro Phe Ser Met 
1 5 • 10 "15 



Arg lie Leu His Ala Glu Leu Gin Gin Tyr Leu Gly Asn Pro Gin Glu 
30 20 25 30 

Ser Leu Asp Arg Leu His Lys Val Lys Thr Val Cys Ser Lys lie Leu 
35 40 45 

3 5 Ala Asn Leu Glu Gin Gly Leu Ala Glu Asp Gly Gly Met Ser Ser Val 

50 55 50 



Thr Gin Glu Gly Arg Gin Ala Ser lie Arg Leu Trp Arg Ser Arg Leu 

65 70 75 80 

Gly Arg Val Met Tyr Ser Met Ala Asn Cys Leu Leu Leu Met Lys Asp 
85 90 95 



Tyr Val Leu Ala Val Glu Ala Tyr His Ser Val lie Lys Tyr Tyr Pro 
45 100 105 110 

, Glu Gin Glu Pro Gin Leu Leu Ser Gly lie Gly Arg lie Ser Leu Gin 
115 120 125 

50 lie Gly Asp lie Lys Thr Ala Glu Lys Tyr Phe Gin Asp Val Glu Lys 

130 135 140 



55 



Val Thr Gin Lys Leu Asp Gly Leu Gin Gly Lys lie Met Val Leu Met 
145 150 155 "160 
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Asn Ser Ala 



Phe Leu His Leu Gly Gin Asn Asn Phe Ala Glu Ala His 
165 170 175 



Arg Phe Phe 



Thr Glu lie Leu Arg Met Asp Pro Arg Asn Ala Val Ala 
180 185 190 



Asn Asn Asn 
195 



Ala Ala Val Cys Leu Leu Tyr Leu Gly Lys Leu Lys Asp 
200 205 



Ser Leu Arg 
210 



Gin Leu Glu Ala Met: Val Gin Gin Asp Pro Arg His Tyr 
215 220 



Leu His Glu 
225 



Ser Val Leu Phe Asn Leu Thr Thr Met Tyr Glu Leu Glu 
230 235 240 



Ser Ser Arg 



Ser Met Gin Lys Lys Gin Ala Leu Leu Glu Ala Val 
245 250 255 



Ala 



Gly Lys Glu 



Gly Asp Ser Phe Asn Thr Gin Cys Leu Lys Leu Ala 
260 265 270 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 1153 base pairs- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TATAAAGAGT GACTCTCCTA TGAAGGTAAA GGCCACCCCT CTTCAGTTCC AGTGACTGAG 60 

ATACATTTTT CCAATCCTGG GGGCAAATAC AGACACAGCA AGTTCCTTCT TCCCTTTGGA 120 

AATTTGGCAG CTGCCTTCAC CAGTGAGCAC AAAGCCACAT TTCAAAGGAA ACTGACAAAT 180 
TATCCCCAGC TGCCAGAAGA AGAAATCCTC ACTGGACGGC TTCCTGTTTC CTGTGGTTCA " 240 

TTATCTGATT GGCTGCAGGG ATGAAAGTTT TTAAGTTCAT AGGACTGATG ATCCTCCTCA 3 00 

CCTCTGCGTT TTCAGCCGGT TCAGGACAAA GTCCAATGAC TGTGCTGTGC TCCATAGACT 3 60 

GGTTCATGGT CACAGTGCAC CCCTTCATGC TAAACAACGA TGTGTGTGTA CACTTTCATG 420 

AACTACACTT GGGCCTGGGT TGCCCCCCAA ACCATGTTCA GCCACACGCC TACCAGTTCA 480 

CCTACCGTGT TACTGAATGT GGCATCAGGG CCAAAGCTGT CTCTCAGGAC ATGGTTATCT 540 

ACAGCACTGA GATACACTAC TCTTCTAAGG GCACGCCATC TAAGTTTGTG ATCCCAGTGT 600 
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CATGTGCTGC CCCCCAAAAG TCCCCATGGC TCACCAAGCC CTGCTCCATG AGAGTAGCCA 660 

GCAAGAGCAG GGCCACAGCC CAGAAGGATG AGAAATGCTA CGAGGTGTTC AGCTTGTCAC 72 0 

5 AGTCCAGTCA AAGGCCCAAC TGCGATTGTC CACCTTGTGT CTTCAGTGAA GAAGAGCATA 7 80 

CCCAGGTCCC TTGTCACCAA GCAGGGGCTC AGGAGGCTCA ACCTCTGCAG CCATCTCACT 840 

TTCTTGATAT TTCTGAGGAT TGGTCTCTTC ACACAGATGA TATGATTGGG TCCATGTGAT 9 00 

10 

CCTCAGGTTT GGGGTCTCCT GAAGATGCTA TTTCTAGAAT TAGTATATAG TGTACAAATG 9 60 

TCTGACAAAT AAGTGCTCTT GTGACCCTCA TGTGAGCACT TTTGAGAAAG AGAAACCTAT 102 0 

15 AGCAACTTCA TGAATTA-^GC CTTTTTCTAT ATTTTTATAT TCATGTGTAA ACAAAAAATA 10 80 

AAATAAAATT CTGATCGCAT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAPJV 1140 

AAAAAAAAAA AAA 1153 
(2) INFORMATION FOR SEQ ID NO: 16: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 
2 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



30 



35 



50 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Lys Val Phe Lys Phe He Gly Leu Met He Leu Leu Thr Ser Ala 
15 10 15 



Phe Ser Ala Gly Ser Gly Gin Ser Pro Met Thr Val Leu Cys Ser He 
40 20 25 30 

Asp Trp Phe Met Val Thr Val His Pro Phe Met Leu Asn Asn Asp Val 
35 40 45 

45 Cys Val His Phe His Glu Leu His Leu Gly Leu Gly Cys Pro Pro Asn 

50 55 60 



His Val Gin Pro His Ala Tyr Gin Phe Thr Tyr Arg Val Thr Glu Cys 

65 70 75 80 

Gly He Arg Ala Lys Ala Val Ser Gin Asp Met Val He Tyr Ser Thr 

85 - 90 95 



Glu He His Tyr Ser Ser Lys Gly Thr Pro Ser Lys Phe Val He Pro 
55 100 105 110 
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Val 3er Cys Ala Ala Pro Gin Lys Ser Pro Trp Leu Thr Lys Pro Cys 
115 120 125 

Sar Met Arg Val Ala Ser Lys Ser Arg Ala Thr Ala Gin Lys Asp Glu 
5 130 135 140 

Lys Cys Tyr Glu Val Phe Ser Leu Ser Gin Ser Ser Gin Arg Pro Asn 
145 150 155 150 

10 Cys Asp Cys Pro Pro Cys Val Phe Ser Glu Glu Glu His Thr Gin Val 

165 170 175 



15 



Pro Cys His Gin Ala Gly Ala Gin Glu Ala Gin Pro Leu Gin Pro Ser 
laO 185 190 

His Phe Leu Asp lie Ser Glu Asp Trp Ser Leu His Thr Asp Asp Met 
195 200 205 



lie Gly Ser Met 
20 210 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH:- 42 8 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: cDNA 



3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 

TTTAATCTGT GTCTCCAGCA TTTATTTTTT TGTTTGTGTC ATCGGGTTCC TGGTTTTCTT 60 

TTAAGACATA GTCAACTGTG TGGACCTGTA GGTTTGGGGC AGCAACCAAT TCCATTGTTT 120 

40 

TCCTTTTTGT CAAATCCAAG AGAAAATATA CCATAAGGAG CTAGAAGATT CTAGTTCACA 180 

GCCTTTTGAA TCTTCATGGC CTTTGAATCC TCATGGCCTC TGAAATCTGA ATCAGTTTTC 240 

45 TCCCAGGARG TCTCTGGGGG CTGAGCTGCT ACAGGGGCAR ARGGTGGGGT GGGGTTGGGT 300 

GGGARAATCA TCCTGGCACT TCATCGTGCA TGCTATTTCG GGCAGCATCT -pTTTTTTTTT 3 60 

ATTTTATTAT TATTTTTTTT CCTGATGCTT GAGTTATGAA TGAGGATGAC CTCTGCAATC 420 

.50 

ATGATGTCTC CCATAGACTC TGTTCCTTGT TCCTTT'GCCA GCTTTCTCAT GCATGGTCCT 4 80 

AACACTTCCA TGATTTAATC TGCTGCAGGA CCATAGTCTT CAGCCACCTC AGCAATAACT 540 

55 TGTTAGAACA TTAAAAGGAA GTAAATTGAG AACAACTTGT TGCCATCCCA TTTTCATTAG 600 
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AAATCAGACA TCTTAGAGAT GTCAAGAAAG 
TCCTGCTCAC ACTGCTGTGT GTCAGACCAG 
5 GCCCTGATCT CTGGAACTCT TGCCACGTTG 
ACCATAACAA TCACAAATAG ACAGTAGTGG 
AGCAGAJ\AGG GAGCTGTGAG CCTTGCCCTG 

iO 

GTACTTCTCC CCTTCCTCTG AGCAGAGCTT 
AAGCAGCAAA AGAAGACAGC AGGTTCTGAC 
15 AGGCCCATTA ACCCCACCCC CCTCGCCTCC 
AGAATACTGA TGAGAAACCT AGTCTGGATT 
GACCAGTGCT CAAAATTGTG GTTATTTTTG 

20 

TTTCACCTTC CTCGCCCAGA TCCAGTTAAC 
AGTGACATGC TGGATTTTAA TTTTTATTGT 

2 5 GTTCCTCCCA ATTTAGCGCA CCATCCTGGG 

ACCTTCACCA CCTGTCGCAT CCTGCATCCT 
CTTAACTCAG CCCCAACTCC AGCTTGTGGC 

30 

GCCACACCAT GCACTCCACG GAGACTGAGC 
TCCACGACCA CCATGAGCAC ATCCCTGGGG 

3 5 TCTGCTGCCG TGTACGACCC CCAGAGCTGG 

TCCTACACGC CCAAGATGGC TGTGATCCCC 
CCCACATCCT CCCCACCCTC CTTTGAGTTC 

40 

CTGGCTTCCA AGCCAGCCAG CTCCATCCTG 
AGCGAGAGCC AGACCGACGT GTCCGTCTCC 

4 5 TTTGGGGTGG CCAAAGTGGT GAACTCAGGG 

CAGGGACCCC TCCTCTGTGG GCCCCCGGGG 
GTACCTGAGG GCCTGCCCCT CAGATGCCCC 

50 

CTCAATAGTG'GCATCCGGCG" GAATCGCAGC 
ATGAAAGCTC CTGTGACTCT CACCTCGGGC 
55 ACTAGCTTAC GGTGAGGACT GGAGGGGGGC 



CAGCTAGCAG CTAGGGGGTA TGGGGACCTG 6 60 

ACCTGATCCT GGAGCTCAGG ACCCTAGAGA 7 20 

TTGCTGAGGC AGCTGAAGTC CCCATCTCCC 7 80 

AGCCAGCATC CCCAGGCCCC TTTTTGTGTA 8 40 

TTTGCAGGTG TCAAGTGCCT CTCCCTGCCT 9 00 

TGGTAGCTGT TGCCAA.TGCA AAGA-AATGTA 9 60 

CTGAGGAGGG AAACCAAATT TATCCCACAA 102 0 

CACCCCCAGA CTGGATCCAC TACTGGCCCA 10 30 

GGGTCGGAAG CTGGAATTTG GTGCTCTGCA 1140 

AGGACTCGCC TTCAATCCAG AACATTTGCG 1200 

AAGGTAGCTC ATCACTTCTT GCATCTGTTG 12 60 

GGTTGTACTT GGATGCAAGG AATATGTTTT 1320 

AAGTGCATGT CTCAGACCAA CTCCACCTTC 13 80 

TCAGATGAGC TCACTCGGGT CACACCAAGC 1440 

AGCACCAGCC ACTTGAAATC CACGCCGGTG 1500 

CTGGCTGAGT CCTTCACTAA CACCCGTGAG 1560 

CTCGTGTGGC TGTTGAAGGA GCGGGGCATT 1620 

GACAGGGCCG GCCGGGGCTC CCTCCTGCAC 168 0 

TCTACTCCGC CGAACTCGCC TATGCAGACA 17 4 0 

AAGTGCACGA GCCCTCCCTA CGACAATTTC 1800 

AGGGAAGTGA GAGAAAAGAA CGTCCGCAGC 18 60 

AACCTCAACC TCGTGGACAA AGTCAGGAGG 192 0 

CGAGCCCATG TCCCCACCTT GACTGAGGAG 19 80 

CCAGCACCAG CCCTTGTTCC CAGAGGCCTG 2040 

ACTGTGACCA ' GTGCCATCGG TGGGCTGCAG 2100 

TTCCCCACCA TGGTGGGATC TAGCATGCAG 2160 

ATCTTGATGG GTGCTAAGCT CTCCAAACAA 2 220 

CGGTTGCCCT AGAGGAGACC CACGTTCTCT 2280 

95 
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CTTGCTCCCA CCTCCCTCTC TTCCCCCCAC 
CCACCCCCTC CTAAGCTAGA CAAATCAACC 
5 TGTAAAATGT GTACATAGGA CTTGGAGACC 
CACAGGAAGT GCCCCTGCAC TGTCATCACT 
GGGAAGGTGG GGTCCTTTCT TCTTTCCTTT 

10 

TTATCCCATG GATAGGAAAC CAGTGAATTC 
GCACGGGTCA TAACACA.TCT GGGTGTCATC 
15 CGTAAGGAGC CTCCATCCTC AGCCACGTGC 
TTTCTTTTAT GGGTGGCCCA GCTTCTTCAA 
CGTTTCTTTT TTGAGGTGTG ACCTTTTGTT 

20 

TTTTGTAATC AGCTGTCAGG CCAAATGTCT 
CTGCGTTGTT CAGCAGCCCC TCTGTGTTCT 
25 ACATATATAT GCAGGGAAGT AATGGTACTG 
TGAATTTCGA ACACACCAAG CCGCTAATGA 
ACAACCAAAT TGATTTAAGA CCGGACCCAA 

30 

AAGGATAGGG AAAAAGCTTA TTAAAGAAAT 
CACAACCAGG CATAATACCA AACCGGTTCC 

3 5 GCTCCAGCGC TTTTTCTGAA ACCCGAGGCT 

ATTGCAGCCA TTCAATGCCC ACATGCTTTT 
AACAAGATAT TTTCTTTCCC TCCAAAGCCT 

40 

GGAA7VAGATC CAGGTGCTTG TGAAAAGAAT 
TGCTGTCGCC GATTAAGTTT TAAACTTTTA 

4 5 AAACATTCTC GTTCCTTCCA GTTCCAGTCA 

TGTGACTTAT TTATTCTAAT TTGAGGGCTG 
TGTTCCAGAC ATTTATGGAA GGAAAACATC 

50 

TCCCCGGCAG CAGAAGATGT GTCCTTCCAT 
ATACTTTGAG AAAGCCCCTA AGGAACAAGC 
55 TCTGAACACA AGAGTATTGG TTAATTATGT 



AGTGCACTCC CTCCCTCTGC CCTTCTCTGT 2 3 40 

TTGTGCCTAA TGGAGGAAGT GTGGAAACTT 2 400 

TTGTGTCCGC CCTGCTCTTT CTTCCGATCC 2 46 0 

CTCACGAGGA CGTCACCTGT GCTAACCTGG 2 52 0 

TGAGAAGCAC TGAAACTCCC AAGTGTGTTC 2 58 0 

CGTGGCTGGC ACACCACGAG CTGTCATGCG 2 64 0 

GGACACCTCA CCTCGCCCAC CCTGTAGGAG 270 0 

AGCTGACGTG GCTTTCCTGA TCGGAGGGCT 27 6 0 

GACCTTCACT GCTCTGCCTC AGTGGACAGT 2 82 0 

TTCATGCCTT CCCCTTGAAG TCATCCTGTG 2 880 

GACCCGAAAG AGAA-TGTATT TACACTCATG 2 940 

GTGTGATTTG TTTTATTTTT CCTTTTTTTT 3 000 

GTAGTGTATG TTTTCTATGT GGTTCAAATA 3 0 60 

GATAGCAGCT TTTTTCTGGG ACCCAGAGTC 312 0 

GACACCTTTA ACAATAGGAC TGAAAGGAAA 3180 

GTGTCAACAC CAAATGTAGA GGGGAAGAAC 3 2 40 

AGGGGGAAAC AAGGCTTTGG TATTCCGCTG 3 300 

GGCCAGGGTG CTGTCACCGT GTGGTCTTTG 3 3 60 

CCTTCTTGTT TCAGAACAGC ACATGGTCAC 3 420 

TTTGTCTCCT TGTGCCTCTT TTTATCCTTA 3 48 0 

CATGAATGCA ACAAGGGAGG CTGGTCCTGT 3 54 0 

TTTATTATTT ATGTCTGCCG TATTTTAAAT 3 600 

TAGTGTGTCT GTGGCATTCC AGTCCAACCA 3 66 0 

CACTGTACAC CATGGTGTCC TGTGACACCG 372 0 

CCATATAAAT GAAACTGTCA TGCTGTGTCC 378 0 
TGAGTGAGGG TAACCTTATG TCCACCAAGG ' 3 840 

CTCAGTCCCA CGGTTTCAGA CTATTTATTC 3 9 00 

TCTCAGCTCT CCCTGCTGTT GTATGTGTGC 3 9 60 

96 
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ATTCACTGCA AGTAACTTAT ATCTTTTTAT TTGAATGTAT TTTAAAGCAG TAGATAGAAT 4 02 0 

AACAAAGGAA TATGAAAACC ATGGACTGAA TGGACCATTT TATGTATTCA GAGAGAGAAG 4080 

5 CCACTCATCA TTGCCAGAAA TACCATGTAA A-OlATTGGCAG TTCAGAGGTT GCAATACTTA 414 0 

GTATAGTA.AA TAAATAAACG GTCAACATTG TGCAACCACT ACCAAAAAGT GTGTTGTAAT 420 0 

GCATCAAAAA TCAACACAAT TTTATTCACT AATGAGTATC AATAAA-ATAA GTTCAAATGA 42 6 0 

10 

TGGAAACCAC AAAAAAA-QlAA AAAAA 42 8 5 

(2) INFORMATION FOR SEQ ID NO -.18: 

15 (i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 429 amino acids 

(B) TYPE: amino acid ^ 

(C) STRANDEDNES3: 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: protein 



(x£) SEQUENCE DESCRIPTION: SEQ ID NO : 1 8 : 

Met Gin Arg Asn Val Lys Gin Gin Lys Lys Thr Ala Gly Ser Asp Leu 
15 10 15 

30 

Arg Arg Glu Thr Lys Phe lie Pro Gin Arg Pro lie Asn Pro Thr Pro 
20 25 30 

Leu Ala Ser His Pro Gin Thr Gly Ser Thr Thr Gly Pro Arg lie Leu 
35 35 40 45 

Met Arg Asn Leu Val Trp lie Gly Ser Glu Ala Gly lie Trp Cys Ser 
50 55 SO 



4 0 Ala Asp Gin Cys Ser Lys Leu 

65 70 

Ser Arg Thr Phe Ala Phe His 
85 

45 

Val Ala His His Phe Leu His 
100 



Trp Leu Phe Leu Arg Thr Arg Leu Gin 
75 80 

Leu Pro Arg Pro Asp Pro Val Asn Lys 
90 95 

Leu Leu Ser Asp Met Leu Asp Phe Asn 
105 110 



Phe Tyr Cys Gly Cys Thr Trp Met Gin Gly lie Cys Phe Val Pro Pro 
50 ' 115 120 , _ 125 

Asn Leu Ala His His Pro Gly Lys Cys Met Ser Gin Thr Asn Ser Thr 
130 135 140 

55 Phe Thr Phe Thr Thr Cys Arg lie Leu His Pro Ser Asp Glu Leu Thr 
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145 150 155 160 

Arg Val Thr Pro Ser Leu Asn Ser Ala Pro Thr Pro Ala Cys Gly Ser 
165 170 175 

5 

Thr Ser His Leu Lys Ser Thr Pro Val Ala Thr Pro Cys Thr Pro Arg 
130 185 190 

Arg Leu Ser Leu Ala Glu Ser Phe Thr Asn Thr Arg Glu Ser Thr Thr 
10 195 200 205 

Thr Mec Ser Thr Ser Leu Gly Leu Val Trp Leu Leu Lys Glu Arg Gly' 
210 215 220 

15 lie Ser Ala Ala Val Tyr Asp Pro Gin Ser Trp Asp Arg Ala Gly Arg 

225 230 235 240 



20 



50 



55 



Gly Ser Leu Leu His Ser Tyr Thr Pro Lys Met Ala Val lie Pro Ser 

245 250 255 

Thr Pro Pro Asn Ser Pro Met Gin Thr Pro Thr" Ser Ser Pro Pro Ser 

260 265 270 



Phe Glu Phe Lys Cys Thr Ser Pro Pro Tyr Asp Asn Phe Leu Ala Ser 
25 275- - 280 2B5 

Lys Pro Ala Ser Ser lie Leu Arg Glu Val Arg Glu Lys Asn Val Arg 
290 295 300 

3 0 Ser Ser Glu Ser Gin Thr Asp Val Ser Val Ser Asn Leu Asn Leu Val 

305 310 315 320 

Asp Lys Val Arg Arg Phe Gly Val Ala Lys Val Val Asn Ser Gly Arg 
325 330 335 

35 

; Ala His Val Pro Thr Leu Thr Glu Glu Gin Gly Pro Leu Leu Cys Gly 

340 345 350 

Pro Pro Gly Pro Ala Pro Ala Leu Val Pro Arg Gly Leu Val Pro Glu 
40 355 360 365 

Gly Leu Pro Leu Arg Cys Pro Thr Val Thr Ser Ala lie Gly Gly Leu 
370 375 _ 330 

4 5 Gin Leu Asn Ser Gly lie Arg Arg Asn Arg Ser Phe Pro Thr Met Val 

385 390 395 400 



Gly Ser Ser Met Gin Met Lys. Ala Pro Val Thr Leu Thr Ser Gly lie 
405 - 410 415 

Leu Met Gly Ala Lys Leu Ser Lys Gin Thr Ser Leu Arg 
420 425 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 751 base pairs 
(3) TYPE: nucleic acid 
(C) STRAND EDNES S : double 
5 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19; 



ACTTTGAATT TTTTATTTGT GAAATTAAAA ATATGGTATT ATATATATAT AAACTTCTAT 60 

15 

TCCTCTATAA ATATAGATGA TTTTGTGATA GTGAACAGAA TAAATGTATA CCAAATTCAA 12 C 



AGACCAATAT CATTTTAGCG TATGACAGAC ATAGATAAAT TTAGGTCCTA AGTACCGGCA 180 



20 TTTTGAT.AAA TTCTTAAAGT TTAAAAC-^VAT ACAATCAGGA GGATTGCTTT TCTCCTCTTC 240 



TTCACAGAGA ACTAAAGTGA ATATTTTTAA ATGGCTTTGA AAGATTTACA TTTGACACAT 300 



TTCTGTAAAT CCAAAAGAGG AGCACACAGG GATTTAATGC AGTAGACCTG CACACATTTT ■ 3 60 

25 - - 

. CCCTTTAGCA TGCATGCCCA TATTTTGTTT ATTTCAGGCG CTATCTCCCC GTCAATTATT ■ 42 0 



CCACCTTCTT TACCTCCTGA AATCTTACCA GGTTATTATT GGTGGTGTGA ATTGTTCCCC 48 0 



30 CCTCAGAATG TGCTGCTGAA TAATAATCGT AATAAAATGT TGAAAGTGTA CAACTTTTAC 540 



ATTTTAAAGT TTCTGATATA TGTCTAGTTA TTTGATTAAA AATAAGAAAA TAGCACTTCA 600 



TTTTGAGGAA GTCCATGACA CTGAAATATC CTTCAAGTTT TCAATTTCTG TTTACGTTTT 660 

35 

GCTGTCTTGT TAAGGAAAGC AAACATCAAC TCCTT.QJVCAA AGCTTTCCAG GTGACCTCAA 72 0 



CATTTCCATT TTACAGACCG GTAAAATCTA AGCGCAGGCT GTCTCATTCT CAAAGGCAAG 78 0 



4 0 GTTGCCAGGC ATCCGTATGC AATTAGAATT AACATTTTAT AACCCATATC TTCAGTCTCT 840 



TCCAACCCAC ACAAAGCTTC ATGCTTCTTC CCAAATCTCA GTAACCACAT CTTTCCATGA 9 00 

CGCTGGCCAA ACCCATACCA GGTTTTAGAC ACTAGAGAAT GAAATGAGCT " CACCCCTCAA 9 60 

45 

AAATTAGACT TCAAAAAGTT TGGCATTGGT TATCTCACTC ACCCTGTAAC CAACTAAGGT 1020 



GGGAGAAGGG AGTGTCTGGC GTTGAAGGTG ACCGTGGAGG GAGGCTGAGA CTGCCAGCGC 10 80 



50 CCACACCCGT GGGCCCCCAT GAAGTTGGAG GAAAGTTCTG GACAGTTAAA AATCCAGCTT 1140 



CAGGAAGTCG AAGGGACGGG CCTTCGCAAT CCACCGCCGA GCAAGGGAGG AATTGTAATG 12 00 



TATGGGGGCC CTCCTCCAGA TTTGGAAGGT TTGTGGAGTT CTGTACCTTA AGAGCCCCTA 12 60- 

55 
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CCTCAAGCCA GGAAAGAAAG GGAGGGGACA 
CGGGAAGTGA CCCTGGCAGC GCAGCCCTAG 
5 AGCTGGCGCC CAGCCGTCCC GAGCACCCGT 
AGCGCTGTGC GGTTGCACAC CAGTTTTAGC 
GAAAAGCCTC AAATGGTTAA ACCGCCCTAA 

10 

CGTGATCGGT CGTCATTTAA ATACAAATAT 
A^TCATAAAA GCGAACAGTC CTGGTACCAG 
15- CCTCTX3GCAG TCGGGCCCTC GTGTCCTTTT 
GACTTGTTCT AGGCCTCTTT GGCCGAATTC 
ATTTCATTCA GAGAATATTT ATGAAATGCC 

20 

GGCCACTTCT CAGTGAGGGA GAGATGTAGT 
AAGCAGTAAA TAGCATGTCT TTCCCCTCCC 
25 CCCTAATGGA ATTCATGAGT TTGGGGGTCT 
TGATGATTCT GGATTTTTGC TGTTTGTTAT 
GTGGGTGGCC ATTTCCTGAC ATCTGCATGT 

30 

CAAATCAGAA AGCTGACCCC AAGACTGCAA 
GTAGCGTAGA CAACAGCAGT CATAAATAAT 
3 5 TTGACCACCT CTACCACCAG AACCCAGCAG 
ACTCGATGTT TTTGTTTTGC ATTTTCTCCT 
GTCTGTAGAG TCAGCACAGC CCCATCAGTC 

40 

TCCCATTAAC TGACATGGTC AGATTTCCAG 
CTCTCCYTGT GAGAGGAAAY TGCGCGSGAA 

45 AGCCAACATG CCTCGGTATG CGCAGCTGKT 
YTACTGTGCC ACCATGGTCC AGCACTGTGA 
CCTGGATCCA GCAGCAGAAC ACTTCAAYTA 
CGAGGTGGAT GATGTAATGG AGGATGATTY 
ATTTTGCATG GAGTACTTTG CCAATAATTT 

55 AGAGGACGAC TATATCCTTT TTGATTGTCC 



GAAGGAGGGG GAGGGGGCAA AAGGAGGAGG 132 0 

TCGCACCCCG CAGTGCTGAJii CTCGCCCCGG 13 80 

GGTAGGGAGA GGCGCGC GAG GACGACCAGG 1440 

TCCTTTGCAA TACTCCGAAA AGGGCAAGAA 150 0 

ATAATTAAAA ACTTTTGAAA AAGAAAAACG 15 6 0 

ACTTACAAAA ATCCTACACA GGCTATTTAC 16 2 0 

AGTGTGAGGG CAAGAGGTCT GTCCATCCTC 16 8 0 

GCCTCAGGGA CGGAAGCTTT TGCAGGAGCT 17 40 

GGCCAAAGAG GCCTAATTCC TTCCTCGGTT 18 00 

TACTGTGTGC AAGTCATCCA TCCTTGAAAA 18 60 

GGATTCTGTG AGACATACCT GCTGGAGTTG 19 20 

CGATCTTAAG GTGTGTTTTC TAGAAAAGTT 19 8 0 

CAGTCACCCG CTTGCCTGTA GGATTCCATT 2 040 

TGCCCTTAGA GGGGCTCTGA GTATCTACTT 2100 

ACCTCGTGGA ATTCAGCCAG CTTCATGTTG 2160 

ATCAATGAAG GTATTGGCAT TGTTAAGGTC 2 22 0 

TAGGCAGGAA CTTAACCCAA ATCTAGTTCT 22 8 0 

ACACTCACAT CTCCTGATAA GAGTTGCTGG 23 40 

CTCCTTCCCC ACTTACTCAG AGAATTTAAA 2 40 0 

CAGGAACTTC CCACCACCAG CCCTTGACTG 2 46 0 

CTCCCCCTAC TCCCTGCTGT GAAACAATCC 252 0 

GGYTAAGGGA GTGTGGCGGG CGGYTCCGGG 258 0 

CATGG5CCCC GCGGGCAGCG GGAAGAGCAC 2 64 0 

AGCCYTCAAC CGGTCTGTCC AAGTTGTAAA 27 0 0 

CTCCGTGATG GCTGACATCC GGGAACTGAT 27 6 0 

TYTGCGATTC GGTCCCAACG GAGGATTGGT 2 82 0 

TGACTGGCTG GAGAACTGTC TTGGCCATGT 2880 

AGGTCAGATT GAGTTGTACA CTCACCTGCC 29 40 

100 
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TGTGATGAAA CAGCTGGTCC AGCAGCTCGA GCAGTGGGAG TTCCGAGTCT GTGGAKTTTY 3 000 

TYTTGTTGAT TCTCAGTTCA TGGTGGAGTC ATTCAAGTTT ATTTCTGGCA TCTTGGCAGC 3 060 

5 CCTGAGTGCC ATGATCTCTC TAGAAATTCC GCAAGTCAAC ATCATGACAA AAATGGATCT 3120 

GCTGAGTAAA AAAGCAAAAA AGGAAATTGA GAAATTTTTA GATCCAGACA TGTATTCTTT 3180 

ATTAGAAGAT TCTACAAGTG ACTTAAGAAG CAAAAAATTC AAGAAACTGA CTAAAGCTAT 3 24 0 

10 

ATGTGGACTG ATTGATGACT ACAGCATGGT TCGATTTTTA CCTTACGATC AGTCAGATGA 3 3 00 

AGAAAGCATG AACATTGTAT TGCAGCATAT TGATTTTGCC ATTCAATATG GAGAAGACCT 3 36 0 

15 AGAATTTAAA GAACCAAAGG AACGTGAAGA TGAGTCTTCC TCTATGTTTG ACGAATATTT 3 42 0 

TCAAGAATGC CAGGATGAAT GAAGAGTTTA CTAAAAGTAA CCATCTAAAG AGCTTGTGGC 3 48 0 

CAAACCAGCA GAACATTCTT CTYTTCAAAG GATGCAATAG TAGAAAGCTA CTTATTTTAA 3 540 

20 

TGAAAAAAAG TAAAACTTCG TTCTTTATCA GCCTCATGCC TGAATCAAAT TTTTAATTAT 3 60 0 

TCTGAAACTG CTGCTGTTTA AAGTGGAATC TTTTAGTATT ATAACAGCAT CACTTTAGAT 3660 

2 5 . TTTGTAAGTC "AAAATTGAAA TGAATGCACA . TAGATTTATA TATAAATTAG CACCTGAGCT 372 0. 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 3751 
(2) INFORMATION FOR SEQ ID NO: 20: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 
■ (B-) TYPE: amino acid 
( C ) STRA^EDNESS : 

3 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Met Pro Arg Tyr Ala Gin Leu Xaa Met Xaa Pro Ala Gly Ser Gly Lys 

45 1 5 . 10 15 

. Ser Thr Tyr Cys Ala Thr Met Val Gin His Cys Glu Ala Xaa Asn Arg 

20 25 30 

50 Ser Val Gin Val Val Asn Leu Asp Pro Ala Ala Glu His Phe Asn Tyr 

35' 40 45 



Ser Val Met Ala Asp lie Arg Glu Leu lie Glu Val Asp Asp Val Met 
50 - 55 60 



55 
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Glu Asp Asp Xaa Leu Arg Phe Gly Pro Asn Gly Gly Leu Val Phe Cys 
65 70 75 80 

Met Glu Tyr Phe Ala Asn Asn Phe Asp Trp Leu Glu Asn Cys Leu Gly 
85 90 95 

Kis Val Giu Asp Asp Tyr lie Leu Phe Asp Cys Pro Gly Gin He Glu 
100 105 110 

Leu Tyr Thr His Leu Pro Val Met Lys Gin Leu Val Gin Gin Leu Glu 
115 120 125 

Gin Trp Glu Phe Arg Val Cys Gly Xaa Xaa Xaa Val Asp Ser Gin Phe 
130 135 140 

Met Val Glu Ser Phe Lys Phe He Ser Gly lie Leu Ala Ala Leu Ser 
145 150 155 160 

Ala Met He Ser Leu Glu He Pro Gin Val Asn He Met Thr Lys Met 
165 170 175 

A.SP Leu Leu Ser Lys Lys Ala Lys Lys Glu He Glu Lys Phe Leu Asp 
180 185 190 

Pro Asp Met Tyr Ser Leu Leu Glu Asp Ser Thr Ser Asp Leu Arg Ser 
195 200 205 

Lys Lys Phe Lys Lys Leu Thr Lys Ala He Cys Gly Leu He Asp Asp 
210 215 220 

Tyr Ser Met Val Arg Phe Leu Pro Tyr Asp Gin Ser Asp Glu Glu Ser 
225 230 235 240 

Met Asn He Val Leu Gin His He Asp Phe Ala He Gin Tyr Gly Glu 
245 250 255 

Asp Leu Glu Phe Lys Glu Pro Lys Glu Arg Glu Asp Glu Ser Ser Ser 
250 265 270 

Met Phe Asp Glu Tyr Phe Gin Glu Cys Gin Asp Glu 
275 280 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TNCAGGCCTT GCGTTCCTAG CTGCTCTGC 2 9 

5 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



20 



30 



35 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GNGCTGTGAG TTTATCCACA AAGGAACAG - 29 

(2) INFORMATION FOR SEQ ID NO:23: 

25 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2'9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:23: 
GNATAGGAGG TCCCAAGTTA TCAAGGTTT 2 9 

(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
50 (A) DESCRIPTION: /desc = "oligonucleotide" 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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10 



15 



20 



GNTTTCCTGG TTCTTGGTCA GGTTTCCTC 2 9 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ocher nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



, (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CNAGATGCAA TGGTTGTGAG ATTGACCAA 29 
(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
25 (B) TYPE: nucleic. acid 

(C) STRANDEDNESS.: single 

(D) topology; Linear 

(ii) MOLECULE TYPE: other nucleic acid 
3 0 (A) DESCRIPTION: /desc = "oligonucleotide' 



3 5 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 : 

GNCACTTTCC ACTGCTGTGA GCTTGTCAT , 29 
(2) INFORMATION FOR SEQ ID NO: 27: ' 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
55 ANCAGACAGT TTGCCATGGA GTACATCAC 29 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
5 (B) TYPE: nucleic acid 

(C) STP-ANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
10 (A) DESCRIPTION: /desc = "oligonucleocide" 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 

TNATG.IUVCCA CAGGAAACAG GAAGCCGTC 29 
(2) INFORMATION FOR SEQ ID NO: 29: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single . 

2 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

3 5 TNAAGGTGAA GGTGGAGTTG GTCTGAGAC 29 

(2) INFORt4ATION FOR SEQ ID NO : 3 0 : 

( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

50 ■ . - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GNCAGAAATA AACTTGAATG ACTCCACCA 29 

55 (2) INFORMATION FOR SEQ ID NO: 31; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 amino acids 
(5) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 

Met Asn Ser Gin Leu Asn Ser Phe Tlor Gly Gin Met Glu Asn He Thr 
15 10 15 

Thx He Ser Gin Ala Asn Glu Gin Asn Leu Lys Asp Leu Gin Asp Leu 
20 25 30 

His Lys Asp Ala Glu Asn Arg Thr Ala He Lys Phe Asn Gin Leu Glu 
35 40 " 45 

Glu Arg Phe Gin Leu Phe Glu Thr Asp He Val Asn He He Ser Asn 
50 55 / 60 

He Ser Tyr Thr Ala His His Leu Arg Thr Leu Thr Ser Asn Leu Asn 
65 70 75 80 

Glu Val Arg Thr Thr Cys Thr Asp Thr Leu Thr Lys His Thr Asp Asp 
35 90 95 

Leu Thr Ser Leu Asn Asn Thr Leu Ala Asn He Arg Leu Asp Ser Val 
100 105 110 

Ser Leu Arg Met Gin Gin Asp Leu Met Arg Ser Arg Leu Asp Thr Glu 
115 120 125 

Val Ala Asn Leu Ser Val He Met Glu Glu Met Lys Leu Val Asp Ser 
130 135 140 

Lys His Gly Gin Leu He Lys Asn Phe Thr He Leu Gin Gly Pro Pro 
145 150 155 160 

Gly Pro Arg Gly Pro Arg Gly Asp Arg Gly Ser Gin Gly Pro Pro Gly 
165 170 175 

Pro Thr Gly Asn Lys Gly Gin Lys Gly Glu Lys Gly Glu Pro Gly Pro 
180 185 190 

Pro Gly Pro Ala Gly Glu Arg Gly Pro He Gly Pro Ala <5ly Pro Pro 
195 200 205 

Gly Glu Arg Gly. Gly Lys Gly Ser Lys Gly Ser Gin Gly Pro Lys Gly 
210 215 220 
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Ser Arg Gly Ser Pro Giy Lys Pro Giy Pro Gin Giy Pro Ser Gly Asp 
225 230 235 240 

Pro Gly Pro Pro Gly Pro Pro Gly Lys Glu Gly Leu Pro Gly Pro Gin 
245 250 255 

Gly Pro Pro Gly Phe Gin Gly Leu Gin Gly Thr Val Gly Glu Pro Gly 
260 265 2-70 

Val Pro Gly Pro Arg Gly Leu Pro Gly Leu Pro Gly Val Pro Gly Met 
275 280 285 

Pro Gly Pro Lys Gly Pro Pro Gly Pro Pro Gly Pro Ser Gly Ala Val 
290 295 300 

Val Pro Leu Ala Leu Gin Asn Glu Pro Thr Pro Ala Pro Glu Asp Asn 
305 310 315 320 

Ser Cys Pro Pro His Trp Lys Asn Phe Thr Asp Lys Cys Tyr Tyr Phe 
325 330 335 

Ser Val Glu Lys Glu He Phe Glu Asp Ala Lys Leu Phe Cys Glu Asp 
340 345 350 

Lys Ser Ser His Leu Val Phe He Asn Thr Arg Glu Glu Gin Gin Trp 
355 360 3 65 

He Lys Lys Gin Met Val Gly Arg Glu Ser His Trp He Gly Leu Thr 
370 375 380 

Asp Ser Glu Arg Glu Asn Glu Trp Lys Trp Leu Asp Gly Thr Ser Pro 
385 390 395 400 

Asp Tyr Lys Asn Trp Lys Ala Gly Gin Pro Asp Asn Trp Gly His Gly 
405 410 415 

His Gly Pro Gly Glu Asp Cys Ala Gly Leu He Tyr Ala Gly Gin Trp 
420 425 430 

Asn Asp Phe Gin Cys Glu Asp Val Asn Asn Phe He Cys Glu Lys Asp 
435 440 445 



Arg Glu Thr Val Leu Ser Ser Ala . Leu 
450 455 

t2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 542 amino acids 
{B) TYPE: amino acid 

(C ) STRANDEDNESS : 

(D) - TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein" 
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10 



25 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Cys Gly His His Glu Leu Asn Asn Leu Asn. Leu Thr Gin Val Gin Gin 
i 5 IC 15 

Arg Asn Leu He Thr Asn Leu Gin Arg Ser Vai Asp Asp Thr Ser Gin 
20 25 30 

Ala He Gin Arg lie Lys Asn Asp ?he Gin Asn Leu Gin Gin Val Phe 
35 40 45 



Leu Gin Aia Lys Lys Asp Thr Asp Trp Leu Lys Glu Lys Val Gin Ser 
15 50 55 60 

Leu Gin Thr Leu Ala Aia Asn Asn Ser Ala Leu Ala Lys Ala Asn Asn 
65 70 75 3G 

2 0 Asp Thr Leu Glu Asp Met Asn Ser Gin Leu Asn Ser Phe Thr Gly Gin 

85 90 , 95 

Mec Glu Asn He Thr Thr lie Ser Gin Ala Asn Glu Gin Asn Leu Lys 
100 105 110 



Asp Leu Gin Asp Leu His ' Lys Asp Ala Glu Asn Arg Thr Ala He Lys 
115 120 125 



Phe Asn Gin Leu Glu Glu Arg Phe Gin Leu Phe Glu Thr Asp He Val 
30 130 125 140 

Asn He He Ser Asn He Ser Tyr Thr Ala His His Leu Arg Thr Leu 
145 150 155 160 

3 5 Thr Ser Asn Leu Asn Glu Val Arg Thr Thr Cys Thr Asp Thr Leu Thr 

165 170 175 

Lys His Thr Asp Asp Leu Thr Ser Leu Asn Asn Thr Leu Ala Asn He 
180 • 185 190 



Arg Leu Asp Ser Val Ser Leu Arg Met Gin Gin Asp Leu Met Arg Ser 
195 200 205 



Arg Leu Asp Thr Glu Val Ala Asn Leu Ser Val He Met Glu Glu Met 
45 210 215 220 

Lys Leu Val Asp Ser Lys His Gly Gin Leu He Lys Asn Phe Thr He 
225 230 235 240 

5 0 Leu Gin Gly Pro Pro Gly Pro Arg Gly Pro Arg Gly Asp Arg Gly Ser 

245 250 ' 255 

Gin Gly Pro Pro Gly Pro Thr Gly Asn Lys Gly Gin Lys Gly Glu Lys 
260 265 270 

55 
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Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Glu Arg Gly Pro He Gly 
275 280 235 

Pro Ala Gly Pro Pro Gly Glu Arg Gly Gly Lys Gly Ser Lys Gly Ser 
5 290 295 300 

Gin Gly Pro Lys Gly Ser Arg Gly Ser Pro Gly Lys Pro Gly Pro Gin 
305 310 315 320 

10 Gly Pro Ser Gly Asp Pro Gly Pro Pro Gly Pro Pro Gly Lys Glu Gly 

325 330 335 



15 



30 



45 



Leu Pro Gly Pro Gin Gly Pro Pro Gly Phe Gin Gly Leu Gin Gly Thr 
340 345 350 

Val Gly Glu Pro Gly Val Pro Gly Pro Arg Gly Leu Pro Gly Leu Pro 
355 360 365 



Gly Val Pro Gly Met Pro Gly Pro Lys Gly Pro Pro Gly Pro Pro Gly 
20 370 375 380 

Pro Ser Gly Ala Val Val Pro Leu Ala Leu Gin Asn Glu Pro Thr Pro 
385 390 395 400 

2 5 Ala Pro Glu Asp. Asn Ser Cys Pro Pro His Trp Lys Asn Phe Thr Asp 

405 410 415 

Lys Cys Tyr Tyr Phe Ser Val Glu Lys Glu He Phe Glu Asp Ala Lys 
420 425 430 



Leu Phe Cys Glu Asp Lys Ser Ser His Leu Val Phe He Asn Thr Arg 
435 440 443 



Glu Glu Gin Gin Trp He Lys Lys Gin Met Val Gly Arg Glu Ser His 

35 450 455 460 

Trp He Gly Leu Thr Asp Ser Glu Arg Glu Asn Glu Trp Lys Trp Leu 

465 470 475 480 

4 0 Asp Gly Thr Ser Pro Asp Tyr Lys Asn Trp Lys Ala Gly Gin Pro Asp 

485 490 495 



Asn Trp Gly His Gly His Gly Pro Gly Glu Asp Cys Ala Gly Leu He 
500 505 - 510 

Tyr Ala Gly Gin Trp Asn Asp Phe Gin Cys Glu Asp Val Asn Asn Phe 
515 520 525 



He Cys Glu Lys Asp Arg Glu Thr Val Leu. Ser Ser Ala Leu 
50 530 535 540 
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What is claimed is: 

1. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 266 to nucleotide 1651; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 521 to nucleotide 1651; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 335 to nucleotide 634; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone as294_3 deposited under accession 
number ATCC 98444; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone as294_3 deposited under accession number ATCC 98444; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone as294_3 deposited under accession number 
ATCC 98444; 

(h) a polynucleotide encoding a mature protein encoded by the cDN A 
insert of clone as294_3 deposited under accession number ATCC 98444; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(j) a polynucieodde encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID IsrO:2; 

(k) a polynucleotide which is an allelic variant of a polynucieodde of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) - a polynucleotide that hybridizes under stringent conditions to any 
"one of the polynucleotides specified in (a)-(j). 

2. The polynucieodde of claim 1 wherein said polynucleotide is operably 
linked to at least one expression control sequence. 
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3. A host cell trcmsformed with the polynucleotide of claim 2. 

4. The host cell of claim 3, wherein said cell is a mammalian cell. 

5- A process for producing a protein encoded by the polynucleotide of claim 
2, which process comprises: ^. 

(a) growing a culture of the host cell of claim 3 in a suitable culture 
medium; and 

(b) purifying said protein from the culture. 

6. A protein produced according to the process of claim 5. 

7. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of SEQ ID NO:2 from amino acid 1 to 
amino acid 123; 

(c) fragments of the amino add sequence of SEQ ID NO:2 comprising 
eight consecutive amino acids of SEQ ID NO:2; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
as294_3 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

8. The protein of claim 7, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2. 

9. The protein of claim 7, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2 from amino acid 1 to amino acid 123. 

10. A composition comprising the protein of claim 7 and a pharmaceutically 
acceptable carrier. - 

11. An isolated gene corresponding to the cDNA sequence of SEQ ID MO:l." 
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12. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 262 to nucleotide 3096; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 1118 to nucleotide 1527; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone aw92_l deposited under accession 
number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDN A insert of clone aw92_l deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone aw92_l deposited vmder accession number 
ATCC 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone aw92_l deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:4; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

13. A protein comprising an amino add sequence selected from the group 
consisting of : - - 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) the amino add sequence of SEQ ID NO:4 from amino add 287 to 
amino acid 422; 
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(c) fragments of the amino acid sequence of SEQ ID NO:4 comprising 
eight consecutive amino acids of SEQ ID NO:4; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
aw92_l deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

14. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:3. 

15. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 612 to nucleotide 806; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 744 to nucleotide 806; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 1 to nucleotide 794; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bd316_2 deposited under accession 
number ATCC 98444; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bd316„2 deposited under accession number ATCC 98444; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone bd316_2 deposited under accession number 
ATCC 98444; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone bd316_2 deposited under accession number ATCC 98444; 

(i) a polynucleotide encoding a protein comprising the amino acid 
* sequence of SEQ ID NO:6; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO: 6 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:6; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 
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(I) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide that hybridi2es under stringent conditions to any 
one of the polynucleotides specified in (a)-(j). 

16. A protein comprising an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; 

(b) the amino acid sequence of SEQ ID NO:6 from amino acid 1 to 
amino acid 61; 

(c) fragments of the amino acid sequence of SEQ ID NO:6 comprising 
eight consecuhve amino acids of SEQ ID NO:6; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bd316_2 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

17. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:5. 

18. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 7 to nucleotide 300; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 1 to nucleotide 363; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bkl30_4 deposited under accession 
number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bkl30_4 deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone bkl30_4 deposited under accession number 
ATCC 98444; 
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(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone bkl30_4 deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:8; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:8 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:3; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

Q) a polynucleotide that hybridises under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

19. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:8; 

(b) fragments of the annino acid sequence of SEQ ID NO:8 comprising 
eight consecutive amino acids of SEQ ID NO:8; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
bkl30_4 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

20. An isolated gene corresponding to the cDNA sequence of SEQ ID NO: 7. 

21. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:9; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 52 to nucleotide 1863; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 1219 to nucleotide 1863; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 1099 to nucleotide 1743; 
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(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bvl31_5 deposited under accession 
number ATCC 98444; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bvl31_5 deposited under accession number ATCC 98444; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone bvl31_5 deposited under accession number 
ATCC 98444; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone bvl31_5 deposited under accession number ATCC 98444; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:10; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:10 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:10; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-{j). 

22. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:10; 

(b) the amino acid sequence of SEQ ID NO:10 from amino acid 430 to 
amino acid 564; 

(c) fragments of the amino acid sequence of SEQ ID NO: 10 comprising 
' eight consecutive amino acids of SEQ ID NO:10; and 

id) the amino axiid sequence encoded by the cDNA insert of done 
bvl31_5 deposited under accession number ATCC 98444; 
the protein being substantially free from other mammalian proteins. 

23. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:9, 
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24. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:il; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 67 to nucleotide 690; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 1 to nucleotide 576; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bv227_l deposited under accession 
number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bv227_l deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone bv227_l deposited under accession nimnber 
ATCC 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone bv227_l deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:12; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:12 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:12; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(I) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

25. A protein comprising an amino acid sequence selected from the group 
consisting of: ' 

(a) the amino acid sequence of SEQ ID NO:12; 

(b) the amino acid sequence of SEQ ID NO:12 from amino acid 1 to 
amino acid 170; 
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(c) fragments of the amino add sequence of SEQ ID NO: 12 comprising 
eight consecutive amino acids of SEQ ID NO:12; and 

(d) the amino acid sequence encoded by the cDNA insert of done 
bv227_l deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

26. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:ll. 

27. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:13; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:13 from nudeotide 657 to nucleohde 1469; 

(c) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:13 from nudeotide 678 to nucleotide 1103; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone cd265_ll deposited under accession 
number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone cd265_ll deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nudeotide sequence of a mature 
protein coding sequence of clone cd265_ll deposited under accession number 
ATCC 98444; 

(g) a polynucleohde encoding a mature protein encoded by the cDNA 
insert of done cd265_ll deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 14; 

(i) . a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:14 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO: 14; 

(j) ' a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynudeotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 
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(1) a polynucleoride that hybridizes iinder stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

28. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:14; 

(b) the amino acid sequence of SEQ ID NO:14 from amino acid 8 to 
amino acid 149; 

(c) fragments of the amino acid sequence of SEQ ID NO:14 comprising 
eight consecutive amino acids of SEQ ID NO: 14; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
cd265_ll deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

29. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:13. 

30. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 15 from nucleotide 261 to nucleotide 896; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 330 to nucleotide 896; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 1 to nucleotide 515; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone ej265_4 deposited under accession 
numbei: ATCC 98444; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ej265_4 deposited under accession number ATCC 98444; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ej265_4 deposited under accession number 
ATCC 98444; 
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(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone ej265_4 deposited under accession number ATCC 98444; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 16; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:16 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:16; 

(k) a polynucleotide which is an allelic variamt of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-{j). 

31. A protein comprising an amino acid sequence selected from the group 
cor\sisting of: 

(a) the amino acid sequence of SEQ ID NO:16; 

(b) the amino acid sequence of SEQ ID NO:16 from amino acid 1 to 
amino acid 85; 

(c) fragments of the amino acid sequence of SEQ ID NO:l 6 comprising 
eight consecutive amino acids of SEQ ID NO:16; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
ej265_4 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

32. An isolated gene corresponding to the cDNA sequence of SEQ ID NO: 15. 

33. An isolated polynucleohde selected from the group consisting of; 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:17; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 946 to nucleotide 2232; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 1336 to nucleotide 1853; 
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(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone ey29_8 deposited under accession number 
ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ey29_8 deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ey29_8 deposited under accession number ATCC 
98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDN A 
insert of clone ey29_8 deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:18; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18 having biological activity, the fragment 
comprising eight consecutive amino adds of SEQ ID NO: 18; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(I) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-{i). 

34. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:18; 

(b) the amino acid sequence of SEQ ID NO: 18 from amino acid 138 to 
amino acid 302; 

(c) fragments of the amino acid sequence of SEQ ID NO: 18 comprising 
eight consecutive amino acids of SEQ ID NO: 18; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
ey29_8 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

35. An isolated gene corresponding to the cDNA sequence of SEQ ID NO: 17. 
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36. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:19; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 2588 to nucleotide 3439; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 19 from nucleotide 3005 to nucleotide 3502; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone gmll4_10 deposited under accession 
number ATCC 98444; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone gmll4_10 deposited under accession number ATCC 98444; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone gmll4_10 deposited under accession number 
ATCC 98444; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of done gmll4_10 deposited under accession number ATCC 98444; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:20; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20 having biological activity, the fragment 
comprising eight consecutive amino acids of SEQ ID NO:20; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(I) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

- 37. A protein comprising an amino acid sequence selected from the group 
consisting of: 

<a) the amino acid sequence of SEQ ID NO:20; 

(b) the amino add sequence of SEQ ID NO:20 from amino add 145 to 
amino acid 284; 
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(c) fragments of the amino acid sequence of SEQ ID NO:20 comprising 
eight consecutive amino acids of SEQ ID NO:20; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
gmll4_10 deposited under accession number ATCC 98444; 

the protein being substantially free from other mammalian proteins. 

38. An isolated gene corresponding to the cDNA sequence of SEQ ID NO: 19. 
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